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FOREWORD 


Our Living Chemistry Senes was conceived by Editor and 
Publisher to advance the newer kno%vledge of chemical medicine 
in the cause of clinical practice The interdependence of chemistry 
and medicine is so great that physicians are turning to chemistry, 
and chemists to medicine in order to understand the underlying 
basis of life processes in health and disease Once chemical truths, 
proofs and convictions become sound foundations for clinical 
phenomena, key hybnd investigators clanfy the bewildering pan¬ 
orama of biochemical progress for apphcauon m everyday practice, 
stimulation of experimental research, and extension of postgrad¬ 
uate instruction Each of our monographs thus unravels the chem¬ 
ical mechamsms and clinical management of many diseases that 
have remained relatively static m the minds of medical men for 
three thousand yean Our new Senes is charged with the msus elan 
of chemical wisdom, supreme in choice of international authors, 
optimal in standards of chemical scholarship, provocative in 
imagination for experimental research, comprehensive in discus¬ 
sions of scientific medicine, and authontative m chemical perspec¬ 
tive of human disorders 

Dr Mason and Mr Bulgren present a fascinating treatise on 
the pnnciplcs and practice of the newer analog and stored pro¬ 
gram digital computers in clinical medicine, medical research and 
hospital administration It is medically applied cybernetics, the 
new science that emerged from the mechanistic materialists of the 
eighteenth century and their modem counterparts The purpose 
of constructing a conceptual world in terms of machine analogues 
is to unravel the mechanism of life in health and disease The 
methods not only yield vital knowledge by a rapid process of piling 
up facts but cardinal wisdom in their simplification through 
effective integration of heterogeneous factors This great concur¬ 
rence between learning and insight via electronic computers marks 
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a new era m quantitative medical thinking that comes out of the 
qualitative advances of the past Yet the automation js as natural 
a growth as an oak emei^ng from current developments in bio¬ 
physics, medical engineering and biomedical computing 

The high speed electronic computer is a new tool, different 
from any ever created, for extending the scientist’s mental activity 
in problem analysis, concept exploration and clinical diagnosis 
Small signals effect large outputs of medical information involving 
extensive data, mulutudinal factors and mathematical manipula¬ 
tions Accumulated knowledge is stored electrically, correlated 
promptly and analyzed automatically by planned programming 
to provide prompt solutions to pressing problems without waiting 
and hoping The computer is a model of mneteenth century 
predestination, devised by Boole and Babbage, English mathe¬ 
maticians, in 1822 to relieve man of the drudgery of computing 
It provides all the basic theoretical and mechanical knowledge 
necessary for a blueprint of the modem giant machines, lacking 
only the thermionic tube to give it the speed of an electron instead 
of a piston This transcendent capacity to make reason sublime 
lay dormant over a century before American scientists began to 
create models of functioning living creatures in search of a deus 
ex machxna 

AU compuung automata fall into two great classes well repre¬ 
sented m the central nervous system The digital, which counts 
elcctnc pulses like beads on an abacus, and the Analog, which 
measures continuous variables like scale lengths on a slide rule 
The former is more precise, the latter faster, but both must be 
programmed, i c , instructed to cany out the desired computation 
Digital computers developed on the pnnciple of desk calculators 
perform mathematical computations heretofore unavailable be¬ 
cause of the great time required for the calculations and the 
relative shortness of human life Significant is the remarkable 
ability of the computer to store information in the form of punched 
or magnetic tape as a permanent medical rnemory for the diagnosis 
and treatment of every human disease Such long-range develop 
ment of a universal tool for the physician will accelerate the 
ultimate goal of clinical medicine Pn^amimng a problem requires 
several weeks, the calculations several minutes m comparison with 
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hand computations that require several years Analog computers 
differ from digital systems as continuous functions differ from the 
discontinuous in mathematics They represent physical conditions 
by voltages and currents rather than by numencal means enabling 
satisfactory investigations of complex problems with many poten¬ 
tial hypotheses via twenty operational amplifiers The authors’ 
experientta docet^ Experto credtte 

Natural systems are of such enormous complexity as to neces¬ 
sitate subdivision of the problem into elementary umts The 
end-result of medical research on a system is to learn enough 
about it to describe it in mathematical equations and then resort 
to computers for the solution The human mind is a marvelous 
computer with 10*® computing dements but limited in speed and 
afflicted with forgetfulness, both corrected by man made com¬ 
puters of less flexibility and little creativity with 10* to 10* com- 
putmg umts The computers can only function m harmony with 
an active human mind, perfonning manipulations according to 
the instructions of the operator unless created with a prodigious 
10* parts Electronic computers arc thus endowing material forces 
with intellectual life without degrading human life into a matenal 
force Newer techmques of molecular mimatunzation and current 
advances in computer organization may yet mime the higher 
mental processes Beigson considered mtelhgencc in its ongmal 
feature as the faculty of manufactunng artificial objects, especially 
tools to make tools, and of indefimtdy stimulating the creation 
of mcreasmgly complex machinery CSiaucer thought that “A man 
hath sapiences thre, Memone, Engin and Intellect also ” But 
Goethe warned. 
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Chapter I 


COMPUTERS ARE TOOLS 


Automation in industry and the use of computers in 
hospital administration and in medical research have stimulated 
a great deal of interest and perhaps some unjustified anxiety about 
automation and its effect on medical practice Installation and 
use of equipment is gaming momentum in hospitals, medical 
schools and research laboratories This book wll attempt to out- 
hne some of the problems that exist, some of the things that have 
been accomplished and some of the current trends toward more 
universally applicable solutions For those Nvho still have doubts 
about the ultunate use of automatic data processing m medicine 
or who fear the effect of these techniques on the practice of medi- 
ane it is hoped that the following discussion nviU show the potential 
savings in time, increase m efficiency and a new type of freedom 
which these new tools offer 

The modem physician is so busy doing routine things ivbich a 
machine could do that he does not at present have enough time 
for the personal attention that his patients used to receive Valuable 
physiologic and biochemical laboratory studies can be automated 
in such a way that considerable analysis and data reduction takes 
place and information is retrieved by the physician m much more 
useful and understandable form Automatic data processing tech¬ 
niques can provide part of the ans%ver to many of the complex 
and challenging problems in medicine 

The techniques wll require hard thinking about objectives, how 
man functions and how the many routine activities could be auto¬ 
mated This means a new order of magnitude of learning and a 
creation of jobs with ne>v purposes, titles and perhaps new depart 
ments in our medical schools There are problems in medicine 
3 
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that may require special techniques but most medical problems 
are made up of the same type of questions that are asked m other 
disciplines Problem solving is an activity of mankind and phy 
sicians function in much the same way that other men function 

What is a computer’ The builders of computers have a much 
different answer to this question than the users of computers 
Occasional clinicians, because of their background or interest, may 
become well versed in the electronic details of how computers 
function and may be able to contribute to the further develop¬ 
ment of equipment which will have special advantages to applica¬ 
tions in clinical problems Such crossing-over benveen disciphnes 
is to be encouraged For the majority of clinicians, medical inves¬ 
tigators and even basic research workers, the computer must 
remain as a black box The use of an automobile is an analogy 
familiar to all The dnier may derive almost as great a benefit 
from knowing only how to start, stop and guide the machine as 
does the garage man uho also takes care of it or the engmeer who 
also designs it 

At tunes a computer definition must include some information 
as to how It IS being used For example, a pencil would not be 
called a computer if it were used m its usual fashion for makmg 
marks on paper A pencil might be considered an analog com¬ 
puter, however, when it is held up at arm’s length by an artist 
in order to estimate the various dimensions of the landscape so 
that they might be transferred to paper in their proper proportions 
Such use of a pencil as an analc^ computer would not be con¬ 
sidered misuse of the penal since it is efficient and appropnate 
to use the pencil first as an analc^ and then in drawing the picture 
If primitive man \vere given a box of pencils with no information 
as to what they were or how they could be used, he might use 
these as tally sticks, taking one out of the box each tunc one of 
his “domesticated” animals passed into the fields in the mormng 
and replacing one pencil in the box as each animal returned m 
the cvemng Used m this wa>, the box of pencils would become a 
digital computer and since stones or crude pieces of wood could 
be used as well, it is rather obvious that the box of pencils used in 
this ivay would be inappropnatc and extravagant 

The slide rule is an analog computer which can be used for 
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multiplication, division, and even more complicated manipula* 
tions The %vell versed user may set up an equation containing a 
number of mampulations and arrange them in such a way that 
the information is maintained on the slide rule until the final 
answer is obtained With more complicated problems, it is neces¬ 
sary to incorporate a pencil-and-paper recording of the partial 
results and to work back and forth bctiveen the slide rule (which 
can be considered the arithmetic umt) and the pencil and paper 
which can be considered storage, part of input-output, and con¬ 
trol If greater accuracy is desired, but mechanical assistance 
seems desirable, the next step in complexity would be to use a 
desk adding machine or a desk calculator Here again, part of the 
solution may be maintained within the calculator between various 
steps of the problem, but the paper and pencil are still often needed 
The research worker who is analyzing curves and desuous of 
integrating might add to his equipment a simple planimcter which 
IS also a computer and capable of determining the area within 
any regular or irregular curvilinear space If a plammetcr is not 
available and a number of curves are to be compared, the worker 
may draw the curves on standard graph paper, cut them out, and 
weigh the cut-out portions of the curves in order to obtain an 
integration of the area beneath each curve The iveight of the 
paper is a function of the migrated curve Each age has had its 
mathematicians and its makers of mathematical machines The 
modem computer is a relati\ely recent development 

The modem analog computer is an ejqjansion of original com¬ 
putational schemes such as the slide rule or the planimeter but 
is capable of much greater variety of uses and far more complex 
problems It is more versatile by virtue of electronic parts instead 
of mechanical pans The modem digital computer has shown the 
greatest development and the greatest promise of all computing 
mschsiscs because a/ fbe concept of a sfored pn^am 
The entire telephone system is a big digital computer with 
which, by successive dialings, we eliminate large groups of phones 
until, with the last dialed number, a single address is selected and 
we then listen to a restricted segment of information, a window 
if you please, which limits the available signal The signal ob¬ 
tained may be a recorded human voice, programmed to play m 
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proper relationship to real tune so that the interrogator may leam 
the time of day By having the necessary local equipment and a 
phone book we participate in a system much more complex and 
expensive than we as individuals require or could afford Those 
subscribers with larger needs may buy more service in the com 
puting system or more and greater variety of peripheral equipment 
or both 

A modem stored program digital computer makes use of the 
telephomc techmque of address points and stored information at 
these points The stored information may be parts of tables, data 
that is as yet unprocessed, statistical data that has resulted from 
partial or complete analysis or instructional material that is part 
of the stored computer program The computer is a step away 
from telephoning a machine to leam the time of day in that the 
computer is programmed to talk to itself, shift data around, work 
with It m a prescnbed manner and, at various points along the 
way, to report out intermediate results until at the end of the 
program final answers are ready for our perusal 

liie analog computer, as its name indicates, uses an electncal 
signal such as voltage which is measured as an analog of some 
physical or measurable quantity under study For addiUon and 
multiplication a circuit is designed with adding units which feed 
in the appropnate additional amount of voltage or an amplifier 
which multiplies the voltage by the required amount Capacitors 
are used to accumulate the voltage over time as an analog of 
integration A capacitor is analogous to cutting out amd weighing 
a graph of a concentration/timc curve If several equations are to 
be solved simultaneously, the circuits can be so arranged 

Such equipment is useful for small to moderate sized problems 
The equipment is relativdy low m cost and fairly easy to assemble 
Rearrangement of the parts can be earned out with a winng 
plugboard Coefficients m the equations can be altered by the 
use of vanable condensers The equipment can be made available 
right in the laboratory If the research worker wishes to analyze 
a curve without mathematically dcnvmg the equations involved, 
he may be able to accomplish this by manipulauon of the dials 
on an analog computer and obtain a good enough fit for most 
purposes m biological wcK-k The analog computer is useful on line 
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in the expenmental procedure if the expenment has to do wth 
electrical impulses, and at times a partial analysis may be used 
to provide an appropnate stimulus as a part of the expenment 

The analog computer can be of tremendous help m teaching 
the applications of mathematics in biology as it is in other fields 
The analog computer may be designed for on*hne simulation as 
a model of the expenment so that repeated adjustments can be 
made in the computer until the results correspond to what is 
observed m the experimental subject Thus, the accuracy of the 
analog which is under investigation in relation to the expenmental 
subject ctm be rapidly checked and modified Such an arrangement 
IS very useful m the development of hypotheses as to how physio¬ 
logic control mechanisms seem to function in the body It is 
possible m this manner to investigate moderately complex prob¬ 
lems where dozens of minor hypotheses must be formulated and 
tested Such testing by adjustment of an analog computer may be 
more economical than modifying programs for a digital computer 

The digital computer can do practically anything that the 
analog computer can do It has been said that if problems require 
a large analog computer, it may often be more economic^ to 
simulate the large analog computer on a stored program digital 
computer than to actually construct the analog computer I^b- 
lems which can initially be solved satisfactorily with an analog 
computer may grow m numbers of simultaneous equations re- 
quinng solution until it becomes necessary to transfer the work 
to a digital computer 

The analog computer has a limited accuracy whereas the digital 
computer can be programmed to develop answers to any desired 
number of decimal places It is difficult to classify analog and 
digital computers as to the types of problems for which they should 
be used It would seem, however, that the digital computer is 
much more versatile and can be used not only for arithmetic 
problem solving, but is easier to program for the solution of 
problems involving many logical decisions This latter type of 
problem is more common m clmical research where the desire 
is to store and retrieve data, to rearrange data in different cate¬ 
gories, and to study relationships of the incidence of signs and 
symptoms in relation to certain diagnoses The digital computer 
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may make it easier to use a combination of logical and mathe¬ 
matical decisions 

The chief advantage and advancement of the modem digital 
computer rests in its facility for storing the entire program wjthin 
the computer Stored program digital computers have a number 
of black boxes designed for input, output, control of the operations 
and storage of data and reference material with varying degrees 
of availability There are two equally important divisions of com¬ 
puter equipment into hardtiare (or the actual physical computer 
equipment) and so-called software (the programs which are to be 
stored in the computer and which instruct the hardware m the 
sequence of steps required for solution of a given problem) 

Tile stored program is made up of a scries of words These words 
are m numerical form or so-called machine language Each u-ord 
includes an instruction as to the step which is to be earned out, 
such as addition or subtraction, or some logical step such as 
determining which line of further logic is to be followed The 
word also includes an address ivhcrc the information is stored 
which IS to be acted upon at this step m the program and finally 
a second address to which the machine is to turn for the next 
instruction Thus, one instruction follows another 

Data are read in from the input section or brought in from 
various storage areas, acted upon, rearranged, re-stored m other 
addresses for later use m the sequence of analysis Comparisons 
are made, tables are searched m the memory, long senes of sinular 
numbers arc summed and perhaps anthmetic means and standard 
deviations and other moments of higher order arc calculated and 
stored 

Finally the results are fed out unto magneue tape, or punched 
cards to be printed, or graphed by ancillary equipment, for 
inspection by the research worker It is obvious that to make 
use of the computer, the investigator must know what he wishes 
coniputod and must have a very detailed knowledge of the steps 
which are required to carry out this computation by hand, even 
though the actual labor of such an analysis might make the hand 
analysis impossible 

The first digital computer, consisting of the digits on the h^nds 
and feet, predisposed man to the use of a decimal system of count- 
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mg Digztal computers actually make use of a binary system in 
which the decimal system can be simulated Even the digital 
computer is made up of electronic components such as transistors, 
relays, vacuum tubes, Ferrite memory cores, and other electncal 
components which have basically t\%o stable states 

It takes about three and one-third times as many characters to 
represent a given magnitude in the binary number system as 
compared 'vith the decimal system Letters can be coded and 
manipulated in the digital computer so that the research worker 
may make use of Boolean algebra or some other system of symbolic 
logic The versatility of the digital computer is thus limited solely 
by the ingenuity of the user in defining his problem and in arrang¬ 
ing the detailed steps or lai^er components of library programs 
and subroutines for the solution of his particular problem 
The power of the tool denves from the accumulation of soft¬ 
ware as well as from the speed, memory capacity, and versatility 
of the physical equipment The tremendous potentialities are very 
seldom appropnately matched by the problems presented This 
IS inevitable The equipment and software are the product of 
thousands of experts using the accumulated knowledge of genera¬ 
tions of the most bnlhant problem solvers The chief difficulty 
today is in formulating problems worthy of such equipment and 
rcstncting computer use to those problems that really need so 
much automatic data processing 

Most physicians will have speafic problems of a more limited 
scope than general problem solving, diagnosis and literature re¬ 
trieval The requirements of effiaent computer use are (1) the 
presence and recogmtion of the problem, (2) an approach to 
solution of the problem including certain stated hypotheses and 
methods of testing these hypotheses, and (3) translation of the 
testing procedure into machine language All of man’s activity 
involves sampling and data reduction Our senses record only a 
very limited amount of the available data around us The temp¬ 
tation when confronted with something new and exciting is to 
ovenndulge 

It is necessary, first, to recogmze the existence of a problem 
which could be solved better by machine methods than by the 
direct application of pencil and paper and the inappropriate use 
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of human brampowcr Computers do not replace thinking They 
demand deeper thought They permit the thoughtful detailed 
sequence of logical steps used in solution of a problem to be used 
repeatedly without repeating the same human expenditure of effort 
Many different problems require parts of the same thinking 
Problem solving has rcpetiuous aspects at all levels and this is 
what allows the use of programmed machines 
Problems range from those of how to get from A to B (so-called 
applied research, or development, or just simple business pro¬ 
cedure) to those problems concerned with defining A and/or B 
(basic research) The problem may be simply one of facilitation of 
some activity now earned out by hand which could be automated 
in order to save time and money or in order to continue the 
service in wew of its expanding proportions Thus, an admini 
strator may wish to automate a detailed billing procedure which 
will inform each recipient of the services and materials which he 
has received An electrocardiologist may wish to provide an ECG 
report of a very complicated mathematical and statistical analysis 
winch IS of known reliability and which tvould require hours or 
days if carried out by hand A psychologist may wish to provide 
hundreds of reports on standardized personality and intelligence 
tests so that most patients seen m an institution would have at 
least these basic test results for use by the medical staff An in 
temist might ivish that all patients seen in his diagnosuc clmic 
have an array of standard questions answered for machine analysis 
before the patient is actually interviewed by his physician This 
might take the place of much of the so called “systemic review ’ 
and might allow faster and more complete review of items which 
are not immediately pertinent to the chief complaint and which 
might otherwise be neglected in some patients These are all 
questions of getting from A to B They are relatively simple to 
program 

The second broad category of problems is the use of automatic 
data processing to help find out something about A and B These 
are questions that come up m the counc of research and investi¬ 
gation at all levels To be practical, it is necessary to link such 
quesUons closely to their proposed means of solution It does no 
good to ask a quesuon that has no detectable or potential method 
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of solution A problem must be defined in sufficiently clear and 
concise terms so that hypotheses may be developed and appro- 
pnate observations and manipulations carried out m order to seek 
solution This is the most difficult area at present in the application 
of automatic data processing m medicine This is where much 
heat IS generated This is where the art of medicine is championed 
and the invasion of machines decried This is actually the point 
at which the living, self-reproducing computer finds he must “fish, 
cut bait, or go ashore ” This is where the physician finds that he 
needs to know more about his problem and problem solving, 
including vanous mathematical and statistical techniques This 
IS where the statistician, if he wishes to tvork in the field of medical 
application, needs to know much more about medical problems 
This is where the mathematician who wishes to help must be 
practical and attempt to apply his theoretical knowledge to the 
problem at hand This is where it becomes evident that each man 
18 dependent upon his fellow men and that they, m turn, are 
dependent upon him Problems under such close scrutiny often 
become very personal and bring out the deficiencies of each and 
every one of us but also offer the challenges that mankind seems 
to live for 

Having “staked out” a problem which seems amenable to auto¬ 
matic data processing, one or more hypotheses must be stated 
An hypothesis is a statement of certain relationships which must 
then be tested by cxpenraentation and analysis of the resulting 
data There may be many separate hypotheses about a problem, 
since each of these is usually concerned with some detail For 
example, the problem might be one of how to diagnose more 
accurately and earlier and more easily a group of related diseases 
The specific hypotheses, then, ivould be concerned with the 
relationships of signs and symptoms in these diseases to the diag¬ 
nosis of certain sub-classifications of diseases or specific diseases 
The testing of a h>pothesis may actually re%eal the need for an 
increased scope of the problem statement rather than an im¬ 
mediate attempt at solution 

As the rigor of the investigator’s approach to the problem 
increases, as the questioner comes closer and closer to the basic 
facts of the problem with which he is working, there are usually 
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recurring loops of thinking which refer back to the statement of 
the onginal subsidiary hypotheses or of the entire problem Often 
the worker finds, m the course of trying to state his problem, 
that he did not really understand the problem in the first place 
This is the real power of automatic data processing and a theme 
^vhlch deserves some repetition—namely, that machine methods 
of solvmg problems demand a level of thinking, an intensity of 
thinking, a strictness of semantic activity which is often a revelation 
to the computer neophyte but which is necessary before any real 
progress can be made with machine use 

There is a continual reworking of the questions, going from 
very general statements to statements that are more and more 
specific This IS assisted by flow charting the actual steps m the 
solution of the problem A flow chart consists of a series of dia 
grammatic blocks, each of uhich contains a statement as to what 
step IS to be taken or what observation made, or what set of 
calculations carried out before the next succeeding step can be 
taken 

The initial flow chart may consist of only two or three blocks 
in the diagram According to the problem, each of these blocks 
must then be worked out m greater detail until, finally, a senes 
of specific logical and/or mathematical equations can be wntten 
In the early days of computer use, it was necessary to carry such 
flow charting and programming all the way to machine language 
This IS no longer necessary since there arc no\v available special 
computer translation programs which allow the programer or 
research worker to write out his proposed program m an under 
standable sentence structure, using almost normal English words 
Providing certain rules are followed, the computer programed 
with a translation program will then convert the recognizable 
human statements into numerical machine language In addition 
there are available many sub routines so that if the worker, for 
example, wishes to obtain a square root of a number, he can 
simply incorporate “the square root sub routine ’ with all of its 
machine language at the specified point in the analysis Many 
powerful sub routines have been incorporated in laiger routines 
which, in turn, may carry out a whole complicated statistical 
analysis at an appropnate place in the desired program Thus 
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for example, the research worker may Nvish to determine the 
simple correlations between several variables and might incor¬ 
porate the library program for correlation analysis into his program 

For the experienced programer with a library of these sub¬ 
routines and special programs, these powerful tools can be as¬ 
sembled simply by the appropriate writing out of a few brief 
statements It is thus possible to make use of the accumulated 
knowledge and skills of the most gifted mathematicians and 
statisticians of our time and of previous generations plus the 
knowledge and skills of thousands of electronic engineers and 
other specially trained people whose vast efforts have been called 
together and assembled m a specific program on a given computer 
to answer your question 

The translation and assembly programs which have been written 
for vanous computers include many automauc methods of recog¬ 
nizing common errors in writing programs so that after the pro¬ 
gram IS written in recognizable symbols and run into the com¬ 
puter for translation into machine language, there is an immediate 
return of questions from the programed computer calling attention 
to a vanety of mistakes in the way in which the program is as¬ 
sembled 

Many problems do not require automatic data processing Some 
problems could better be earned out with a set of hand-sort cards 
or possibly with punched cards to be sorted by machine Some 
problems are best suited to small analogue computers Others 
require a digital computer ^vlth a stored program Some problems 
have a great deal of data which requires efficient access to the 
computer and large storage areas but relatively little actual com¬ 
puting Other problems require very little data to be read into 
the computer, but require lengthy computations before the 
answers are pnnted In most instances, the actual computations 
are so fast that the limitation in efficiency depends upon the rate 
at which information can be read into the computer and read 
out again 

A problem may have se%reral alternative mathematical ap¬ 
proaches which make it more economical to use the larger com¬ 
puter so that the various methods can be tried Then, after the 
best approach is found, a special purpose computer can be used 
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for subsequent repetitive analyses using the approach already 
worked out with the more elaborate equipment In some instances 
computers must be available right m the laboratory where the 
research work is being done Some of these installations must be 
so called “on-line” so that computations may be earned out 
during the experiment and with some of the results used to, in 
turn, modify the experiment, a transponder function The actual 
equipment used for solution of specific problems will, of course, 
often depend upon what equipment is available to the research 
worker 

The cost of data processing with various available types of 
equipment must be taken into consideration Someone may need 
to decide upon the relative value of the project in relation to 
other demands on the equipment There may be facilities of hard¬ 
ware or sofhvare available elsewhere which would make it more 
economical to do the data processing even outside of the original 
institution in spite of there being equipment locally available 

With expanding facilities and expanding problems, it has be* 
come evident that a variety of installations will be necessary and 
that some of these could most efficiently be ranged m a hierarchal 
and compatible fashion so that small computers can be used in 
laboratones or hospitals up to a point and then, when larger 
facilities are needed, the problem can spill over into the next 
larger installation It is inevitable that some problems will require 
the use of large regional data processing centers and computers 
such as those which have been established for national defense 
Tliere are expensive special purpose installations which should 
be available to investigators who would travel to the installation 
for a few weeks or months in order that such equipment not be 
wastefully installed for only brief requirements 

A potential user of computers should take warning that it is 
not wise to attempt to store all data for all possible problems for 
an indefinitely long period of time in order that any future research 
worker with a brilliant idea may rc-sift all of this data in the 
expectation of obtaining the specific information required to prove 
or disprove his hypotheses This approach is doomed to failure 
Every line has an infinite number of points It is possible, there- 
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fore, on the most imnute problOTi to assemble more data than our 
best computing equipment can handle 

It has been estimated that any problem which requires more 
than 10** manipulations cannot be handled by present day com¬ 
puters Even this number of computations would require at least 
three years on our present fastest equipment The clinician would 
like to store all of the information tliat is on a patient’s chart, 
but all of the information that is on a patient’s chart is only a 
very small part of the data which are available about that patient 
So, even if he could store all data from all charts on all patients, 
this still ^vould be an abstract 

The most common approach to clinical research, the path of 
least resistance, has been to collect a large amount of data or as 
much data as can be handled and then to survey such data in 
all of their permutations, combinations and statistics to see if there 
are any striking relationships which might signify some new insight 
into the overall problem The general idea too often seems to be 
to collect a lot of data and then to do a retrospective study picking 
up hypotheses along the way This approach implies that the data 
will be collected ivithout knoNvmg what questions are to be asked 
Such an approach to be successful must theoretically include the 
greatest amount of data that can possibly be stored 

There are certainly some key data, such as patient age and 
sex, which arc worthivhile collecting in almost any clinical inves¬ 
tigation and ivhich will probably be necessary for retrospective 
studies if and when hypotheses are stated There is no reason why 
such retrospective studies should not occasionally be continued 
and should not occasionally be productive However, the common- 
sense approach and the scientific method requires an initial stating 
of some hypotheses and establishing a protocol for collection of 
data in a prospecti\c study hater on, neu’ly formuJafed retro¬ 
spective studies may still be made wth this data 

At least, if one starts out in the prospective mode, there will be 
better collection of those data which are included in the protocol, 
and there is a better chance the data will be useful It is, of course, 
an oft’ repeated truism that no information can be extracted from 
data that is not intnnsically present in the data when it is originally 
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obtained as a sample It js equally true that when large amounts 
of data are analyzed from a good protocol that the techniques 
of analysis may bring out information that would not otherwise 
have been observed, just as a pH meter, spectrophotometer or 
electrocardiograph reveals relationships which could not be eval¬ 
uated with the unaided senses 

As soon as the investigator accepts the fact that he cannot study 
the universe and that most of what he studies is a relatively small 
sample, he may then begin to define that part of the universe m 
which he is interested He may wish then to define what a random 
sample would be for a selected portion of the universe Sampling 
will depend, to some extent, upon the type of expenment and also 
on the analytical techniques which arc available for the analysis 
of the data Attention should be paid to various control conditions 
which should be met m order that the study is confined to the 
area of declared interest 

A number of parameters are selected lor measurement and per¬ 
haps other observations of a non-parametne nature [without a 
scale] are also recorded Environmental conditions should often 
be included The expenmental design should be considered with 
great care and the most important dimensions formulated in such 
a way that they can be separated out m the final analysis The 
cost of the sample must be considered along with the size If the 
cost is too great, perhaps one dimension can be eliminated by 
restncting the experiment to a limited age group or a limited 
sex or certain limited conditions of the experiment The cost must 
be considered in relation to the desired accuracy of the results 

The research worker is often interested m studying a particular 
causc-and-effcct relationship and suppressing those interfering 
forms of random activity, error and noise which obscure the signal 
in which he is interested When the expected result, or the signal 
of interest, is not pronounced, it may be desirable to use repeated 
observations or repeated subjects under similar conditions m order 
to average out the interfering noise of sampling, analytical error, 
random variation, background noise, etc 

The research worker who has a problem which allows him to 
sample a very precise curvilinear relationship with little detectable 
vanation from experiment to experiment is fortunate He may 
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then go directly to curve analysis and the interpretation of the 
observed relationships in terms of specific models In much of 
the medical research today, the observations are not so concise 
The data with which we work requires statistical treatment before 
we can detect the actual relationships 

It IS necessary for the worker to understand sufficiently statistical 
methods so that he may decide upon the risks that he wishes to 
take in calling a result significant when it really is not sigmficant 
or, on the other hand, in saying that there are no significant 
differences between two groups of data when, in actuality, there 
may be a true difference These two types of error are always 
present Anything done to minimize one type of error automatically 
increases the other 

Finally, the requirements for efficient use of automatic data 
processing should at least include some vague idea before the 
expenment is undertaken as to the interpretation and possible 
generalization which will be made when the expenment is finished 
and the data is analyzed The best laid plans would allow the 
research worker to write out his conclusions before the expenment 
is undertaken and then, at the end, to simply pick up the discussion 
which best applies to the results obtained Actually, our hypotheses 
are seldom so precise and our ideas as to the model under study 
are seldom so accurate as to predict the alternative results to this 
extent Even when the results do coincide with some anticipated 
formulation, the mere perusal of these results stimulates one’s 
thinking to the extent that new conclusions and interpretations 
become evident that had not heretofore been considered 

There is, or should be a great deal of education for the investi¬ 
gator at all stages of his work The hypothesis and the onginal 
problem may need to be revised at any stage of the program 
development, its analysis, or after obtaining the final results of 
the computer analysis Any research project, and the use of auto¬ 
matic data processing as well, can break down completely at any 
stage if the human component ceases to function (think) It is 
human nature to expend the greatest effort m getting the program 
to work and in obtaining the data for analysis, and then to relax 
(cease to think) when the pnnt-out is obtained Computers are 
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programmed to make decisions, to do routine thinking, but not 
truly original thinking 

One of the first duUes after obtaining the computer prmt-out 
IS to check the results for tlieir consistency, validity and possible 
hidden meamng The computer is a “dumb brute” which will 
turn out volumes of stupid mistakes if it is so programed, or if 
inaccurate and inappropriate data are fed into it The computer 
technologists have built in many checks and balances which take 
care of most of the possible errors that might occur as a result of 
the actual equipment The standard translaUon programs which 
have been written for translation of newly written sets of instruc¬ 
tions to machine language programs and many of the library 
programs for special analyses have written into them many checks 
which will prevent computauonal and common logic errors from 
creeping into the analysis There is nothing in the programed 
computer, howe\er, tliat will prevent the mvesUgator from allow¬ 
ing errors to creep into his data or even ivorse, from attempung 
to solve a poorly formulated problem by an inappropnatc method 
Good programs pnnt out not only the final results of the analyses, 
but also enough of the intermediate calculauons and conclusions 
80 that the investigator may check the reasonableness, consistency 
and validity of the over-all analysis It is helpful to have the 
automatic data processing equipment hooked up with appropriate 
ancillary equipment to pnnt some of these intermediary results 
in graphic form or to actually take the pnntcd sheets and graph 
the intermediary sets by hand It is possible thereby to pick out 
completely wild values and to go back to earlier steps in the 
program or to original data and find the errors that have been 

overlooked , 

Finally, if all is well, the actual final equations or statistical 
tables mmt be studied so that valid conclusions can be dratvn, 
and appropnatc action taken, depending upon whether the prob 
lem has been one of a business procedure, basic researeh, clinical 
research, or otherwise Some practical application of the result 
may then follow such as the sending of thousands of itemized 
hospital bills to pauents, or a rearrangement of certam faciliues 
in Ac hospital for optimum use It the program has been one for 
basic researeh Ae investigator may then decide it his hypoAcsis 
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IS supported or false and may proceed to develop new hypotheses 
about the problem or perhaps turn to some new problem 
The secret to success rests in having a ^veIl formulated concise, 
answerable question in the first place Automatic data processing 
without this leads to expensne processed data and nothing more 
Large eunounts of data can be boiled down into powerful answers 
only to the extent that many hours of logical thinking about a 
problem have led to a good computer program There is a direct 
relationship between time spent on statement of the problem and 
time saved in data reduction 
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HOSPITAL ADMINISTRATION 
AND OPERATIONS RESEARCH 

The focal point around which all hospital problems in auto¬ 
matic data processing should focus is the patient, who must be 
well cared for so that he will receive maximum medical benefit 
and who must be satisfied so that he will look upon the hospital 
as an efficient and attractive place to which he would be willing 
to return at any time that he is advised to have further treatment 

The hospital administrator may look upon his job as one of 
regulating a large neuvork of loterdigitaung traffic of patients, 
personnel and materials He must be prepared to satisfy demand 
without costly excesses of supply The problem is a continually 
changing one but to a certain extent he is able to predict the 
changes before they occur if he has information about the national, 
regional and local changes m medical practice and hospital care. 

Each hospital has a certain obligation to contribute to this 
over-all fund of knoivlcdge This is usually fairly easy m situations 
where the internal workings are well studied The administrator 
must be prepared to alter hospital facilities in relationship to the 
appearance of new special purpose hospitals in the area He must 
understand some of the epidemiology of diseases and the effects 
of aging of the population on the types of care that must be pro¬ 
vided Innovations m treatment and changes in attitude of patients 
and physicians as well as third party influences such as insurance 
plans and governmental participation all have their effect on 
hospital usage 

The administrator has a continuing problem in keeping track 
of people including physicians, nurses, dietitians, pharmacists, 
laboratory techmcians, housekeeping peisonnel, carpenters, plumb- 
20 
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ers and electricians as well as inventories of expendable and 
permanent equipment, medical suppli«, food, drugs, blood, fluids, 
linens, uniforms and other items, all of which require a considerable 
amount of record keeping, planning, scheduling and regulation 

The nursing supervision of a lar^c hospital is a multifactonal 
scheduling problem which could probably be programed or 
investigated within a computer as a management game There 
are many types of special nursing areas and many different types 
of traimng and experience represented in the nursing staff plus 
hours, illness, vacations, emergency intense care situations, etc , 
etc , etc All of this must mesh without mess 

Architectural changes in the plant itself are continually required 
and can be planned for to the extent that trends are apparent 
Consider, for example, the effect on hospital usage that came about 
during the poliomyelitis epidemic of a few years ago, the facilities 
that ultimately were provided and the more recent diversion of 
these facilities to other purposes Consider what the possible effect 
may be of gastric freezing if it turns out to be an effective treat¬ 
ment for peptic ulcer What changes would occur in hospitals 
throughout the country if a major breakthrough came in cancer 
therapy’ 

When new services are added in one department this often 
influences the traffic of patients and demand for room in other 
areas Questions anse as to whether subsidiary satellite laboratoncs 
or X-ray machines or fluoroscopy units should be installed, or 
whether perhaps whole departments should be moved These are 
but a few of the problems or types of problems that the hospital 
admimstrator is faced with and which require the collection of 
large amounts of data and many man hours of tedious analysis 
if it IS all done by hand 

The hospital administrator also has obligations to the actual 
care of the patient It is obvious that efficient business procedures 
and billing must be provided so that patients know what they are 
billed for In the hospital the patient deserves an optimum oppor¬ 
tunity to find his way through the out-patient department, the 
\ anous consultations and special studies, a minimum of wait for 
admission and reasonable assurance that when major operative 
procedures or other treatments are scheduled that they wll not 
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be canceled for ^vant of the right blood type or because of an 
inadequate supply of operating room nurses 

There js much that the patient does not know about, winch also 
affects the efficiency of his care related to the flow of information 
from biochemical, hematological, bacteriological and other labora¬ 
tories, from x-ray, from consultations between physicians, from 
the electrocardiographic room All of this information flow lies 
in the province of hospital administration and is susceptible to 
modem automatic data processing techniques 

The administrator also has a problem in running the record 
room and in tending to the coding of diagnoses, keeping track of 
operations, complications, deaths, autopsy records, admission and 
discharge data, records of internal transfers of patients between 
services, information about patients’ home addresses and relatives 
as well as records of current liospital location of the patient and 
the physician m charge, data required for insurance forms and 
the summary reports to be used m administrative offices and by 
the vanous services to fulfill the requirements for hospital accredi¬ 
tation and approval of training programs 

The administrator would like to have a proven packaged system 
of known cost to sob e all of these problems He u ould like a 
hospital electronic brain which, like a nervous system, would be 
large enough and efficient enough to co-ordinate all of the com¬ 
plex data gathering parts with the neewsary means of appropnate 
presentation of information at stations throughout the hospital 
according to needs All of this requires a computer which should 
be attached to random access storage on the one hand and to 
larger computers to which it could spill some of the workload 
when needed Such an automated patient data system is theo¬ 
retically possible today as far as the haidware, software and some 
peripheral equipment are concerned It needs electronic engineers, 
programers, operators and systems designers together with properly 
informed and motivated physicians, hospital administrators and 
medical research workers in order to make optimum use of such 
equipment with a consequent savmg in cost to the patient, increased 
efficiency in patient care and use of the collected data for study of 
diseases and epidemiology 

Such a data center would be available also for basic research 
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and in education and could be used not only for the day-to-day 
feedback of information but also for special training of medical 
students and post-graduate students Nettsorks of tea*ching ma¬ 
chines could be attached to provide individualized programed 
instruction This would obviously be a complete solution to the 
problem presented above which the hospital administrator faces 

Such a packaged system does not presently exist However, such 
a system is more than a dream and is being worked upon from 
many different directions There are some\vhat similar problems 
in business that have been solved by computer analysis The large 
computer manufacturers and special organizations supplying con¬ 
sultation services have programs for solving scheduling problems 
and the problems of the best pathway through a complicated 
net\%ork ivith the least manpower and cost in material and with the 
greatest probability of reaching the desired destination Large 
businesses have played games \vith their business simulated on a 
computer to see the effect of changing procedures, offering new 
services, and changing the sizes and location of various components 
in the system These are not unlike the problems outlined for a 
hospital There are also similanties bettveen the floiv of traffic in 
a hospital and regulation of air traffic or the communication sys¬ 
tems for national defense 

It should not be surprising, therefore, to Jeam that the System 
Development Corporation is working on a patient data, computer 
based, system and that they have simulated a hospital within a 
computer using data obtained from actual patient records, pro¬ 
graming the computer to carry out the vanous functions which 
ivould exist m a real hospital At the present time general input 
and output devices are used such as are found m standard com¬ 
puter installations 

The. iilxvro^te. oC Oat cal 

lection and display of data will depend upon the results of studies 
being earned out in other places It is possible that the physician 
would view a text stored in the central system through a penph- 
erally located cathode ray tube and that the display might be a 
pnnted text or digital or graphic display depending upon the 
information requested 

For study of administrative problems in the simulated hospital 
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SIX wards of ten beds each, a phannacy and two laboratoncs were 
established in the computer For the study of problems related to 
patient care three of the wards were provided with a great deal 
of information from actual charts, a primary diagnosis and up to 
three secondary diagnoses All of the necessary data were prepared 
pnor to the experiment and placed on punch cards so that the 
information could be entered into the computer As the game was 
played additional information and requests were entered into the 
computer with the typewntcr and the output came as typed or 
printed information 

Much of the experiment was centered around the physician’s 
order By having the physician’s order immediately stored in the 
computer and all foreseeable consequences of this order of a clencal 
nature automatically taken care of, the nurses, pharmacists, 
business office and other involved personnel would have no 
further clencal work relative to the physician’s order It is part 
of the game to have information reported cither by hospital per¬ 
sonnel or by monitors or both relative to vital signs according to 
the requests of the physician in charge The computer automatically 
would receive and maintain this data in its appropnate form and 
would pnnt out on request these r^ords for any given patient 
A certain amount of necessary feedback u arranged for in the 
program so that the nurse not only knoivs when an order has been 
made by a physician but knows that the phannacy has received 
the order Also when the nurses arc remmded that it is time to 
give a drug they report back to the computer when the drug has 
been given The computer program also keeps a running inventory 
of all supplies in the hospital and at predetermined levels of 
depletion, drugs and other supplies arc ordered 

When the physician orders a laboratory test the results arc 
automatically consolidated into the pauent’s record in the com¬ 
puter The nurse is provided with a schedule of all the expected 
blood letting for that morning’s laboratory tests The laboratory 
is kept aware of the expected workload For those laboratory tests 
requiring special feeding arrangements the computer notifies both 
the nursing station and the diet kitchen Even the discharge sum¬ 
mary IS automatically taken care of This then is the simulation 
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approach and a computer is being used to seek the computer 
based solution 

Another approach to findmg solutions for the many problems 
of a complete data processing system is to set up regional medical 
centers and have them go ahead with equipment now available, 
work out techniques, integrate new equipment as it becomes 
available and actually seek information as to what is needed in 
both hardware and programs The first of these has been set up 
at Tulane University Medical School They are working out 
methods using special machine mark sense cards which can be 
marked by the physician, nurse or other person accumulating data 
for the patient’s record These cards are then run through a 
reproducer which converts them to punch cards and the data 
can then be used in a routine manner with a card reader or con¬ 
verted to magnetic tape or disk storage They are working out 
means of recording all types of data that is present on a patient’s 
chart including the history, physical examination, x-ray examina¬ 
tion, laboratory tests and diagnosis codes A wide variety of com¬ 
puter applications are under study and the principal objective 
of this study is to seek means of application of automatic data 
processing to the types of data storage and retrieval that would 
be of greatest interest to the general practitioner and specialists 
who spend much of their time m diagnosis, treatment and routine 
patient care There are other large centers where special appli¬ 
cations are under study in relationship to neuropsychiatnc, neuro¬ 
surgical and neurophysiologic problems 

There is a study in Baltimore of central processing of accounting 
for some six hospitals which, if successful, may provide a non¬ 
profit data processing service bureau for all of the hospitals m 
the state There are. many university medical school computer 
facilities set up fay departments within the medical school or in 
CO operation with electrical engineering departments or the uni¬ 
versity at large 

At the State University of Iowa Hospital patient billing appli¬ 
cations have been a relatively recent exploitation of facilities that 
were originally started m the 1920’s for university and hospital 
general accounting with card-sorting machines The University 
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Statistical Service Unit has handled all of the university general 
accounting and registration for some 12,000 students plus a great 
deal of statistical work for nearly all of the departments and 
schools in the university Recently the card sorting machines have 
been supplemented with a magnetic tape computer 

In anticipation of this addition members of hospital admmi 
stration, Statistical Service and the computer company spent some 
18 months meeting together, defining all of the most detailed 
aspects of the problem eind working out programs for carrying 
out hospital patient billing procedures A master tape was pre 
pared which contains the registration biographical data and hos 
pital numbers of some 500,000 patients treated at the University 
Hospital since its inception in 1927 Every out patient is now 
included in this identification system as well as all of the patients 
admitted to the hospital All hospital charges are put on punch 
cards and entered into a computer program which places them 
on the appropriate patient account on a magnetic tape Eacli 
m patient account is updated and printed out for business office 
use on each of the five work days during the week At the time of 
an m patient or out patient discharge, the discharged patient 
account is assigned an index number inmally set at (9) and re 
duced by (1) on the occasion of each subsequent daily updating 
At the end of nine ivorbng days after discharge, the account 
having accumulated all “late” charges, is automatically printed 
out and made available to the hospital business office for patient 
billing or insurance submission, as the case may be 
As bills are paid receipt information is also added on punch 
cards and when a bill is paid in full the computer pnnts out a 
statement to that effect and the account is removed from the tape 
The Iowa Medical Service Plan which acts as an accounting and 
collecting agent for private staff ph^^ician charges also uses a 
similar system In addition over the years many special medical 
studies requiring data sorting and simple statistical analysis have 
been carried out m the Statistical Service 

In 1958, a separate University Computer Center was also 
established Originally this was a small open shop center but it 
has grown to a larger closed shop center with operators, programers 
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and higher level personnel to help individuals from the medical 
school and the entire university who have computational prob¬ 
lems that they wish to run 

Much of the support of this center has come from its two largest 
users the Io\va Testing Program under Lindquist and the physics 
department satellite research under VanAIlen Coincident ivith 
the growth of the Statistical Service and the University Computer 
Center there have also appeared man> small computers about 
the campus of both analog and digital type Many of the analog 
computers are located in the medical school Some of these are 
so special in purpose that perhaps they should be classified m a 
category with standard electrocardiographic machines, which 
though analog computers, are committed to one narrow t>pe of 
service function The Volemetron is an example of a commercially 
produced portable analog mstniment programed for determining 
blood volumes ivith radioactive human serum albumin or with 
radioactive chromium 51 tagged red cells Most of these analog 
computers have been obtained for special research projects and 
some have been built m the school’s section of Medical Electronics 

It IS inevitable that with the growth of data processing in a 
medical school and m a university that some users will desire 
equipment much larger than they alone can afford and that there 
Will be continual pressure to increase the potential of the central 
computmg center At the same Umc individual users will be faced 
With the decision as to whether tliey would prefer to do their 
computing in the large center and pool resources with other users 
or seek the advantages of having a smaller computer in the im¬ 
mediate area of work 

There is also a continuing problem of personnel Computer 
science is a new interdisciplmaiy field drawing upon mathematics, 
electrical engineering, psychology and speaalists from almost an> 
field in which an individual becomes sufficiently interested in com¬ 
puter science to wish to devote his major effort in this area As 
these individuals become more mvolved in computer science they 
may be looked upon by their onginal specialty area as somewhat 
contaminated rather than pure academicians and scientists No 
apologies need to be made for computer science Computer facil- 
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ities have risen out of a need and are looked upon too much as 
service centers, but there is a great deal of basic investigation 
involved m solution of the many problems These centers should 
be looked upon as a focal point of problem solving, theory and 
research and the problems posed are as vaned as the nature of 
man Man’s greatest challenge ultimately is in generalizing prob¬ 
lem solving and the best problem solvers amongst us should be 
found and encouraged to work with or in such centers There will 
inevitably need to be large central computing facilities available 
in universities as well as smaller computers close to the areas of 
heavy routine workload such as in a college of medicine Probably 
the ultimate solution will be peripheral satellite equipment which 
can be used for smaller routine problems and a large central 
facility tvhich can be shared with other users by special program¬ 
ing and coaxial cable 

The administrator in nonuniversity hospitals may have a simpler 
problem in deciding about equipment and computer applications 
since the immediate applications will be recognized as those of 
business procedures for which there is excellent and fairly reason¬ 
able equipment available Hospital accounting programs have been 
worked out and would need only to be modified for local adapta¬ 
tion There is now available random access equipment for storage 
of data which would make it possible to expand the business 
procedures to include the handling of laboratory data and some 
of the data >vhich might appear on a good discharge summary 
for each patient If the basic equipment for business procedures 
can be justified, the same equipment is then also available for 
clinical research It cannot be overemphasized that expert advice 
and extensive co-operative planning of objectives, equipment and 
program must precede equipment acquisition to succeed 

Perhaps the easiest and most certain way of obtaining immediate 
benefits of modem data processing techmqucs is to participate m 
national studies of defined scope such as those of the paranatal 
study or a broader based study such as PAS One of the best 
nation-wide studies thus far is that of the Commission on Pro¬ 
fessional Hospital Activities, Incorporated which %vas established 
in 1955 as a nonprofit corporation to take over the Professional 
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Activities Study (PAS) As of January 1, 1962 the staff included 
seventy five persons variously trained m medicine, biostatistics, 
data processing, nosology, records and admimstration This group 
IS conducting research on hospital clinical data needs and methods 
of collecting and handling such data Their files contain over six 
million chnical record abstracts Analyses arc made for reporting 
back to the participating hospitals and a number of special studies 
have been conducted relative to morbidity, mortality and medical 
and hospital practices 

The second major program in this study is the Medical Audit 
Program (MAP) which now is being used in thirty eight hospitals 
with a discharging rate of approximately 400 thousand patients 
per year This latter program allows the medical staffs of the 
participating hospitals to evaluate the quality of the medical care 
being provided and to look for special problems 

The PAS abstracting service is subscnbed to by some 230 
hospitals in thirty two states, Washington, D C, Puerto Rico and 
three of the Canadian provinces Growth is continuing at an 
accelerating rate %vith over fifty hospitals added per year now 
compared with about twenty five only five years ago By January 
1963, over ten per cent of ciurent patients discharged from general 
hospitals in the United States were included m this sample 
The entire study is planned around case abstracts Case abstracts 
are produced by nonprofessionaJ but specially instructed clerical 
help in each participating hospital 

A single case abstract takes about five minutes to prepare These 
are sent to the central office for processing and a charge of thirty 
cents IS made to the hospital for each abstract submitted This 
charge provides about 80 per cent of the cost of the entire project 
The additional expense has been made up from vanous grants 
and monies from research projects For the participatmg hospital, 
this plan provides an automatic means of evaluating the quality 
of care ivithout increasing the workload of the physicians 

A physician, Virgil N Slee, has been the director during the 
first decade of the study and continues to guide the service He 
stresses certain important points which could i^ell be adopted in 
other large studies and might serve to answer some of the strongest 
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criticisms of 2.utom3.x.\c data procc^ing in this general area Data 
processing means data reduction and the development of statistics, 
so that the important individual may be lost from view In the 
PAS study, however, the patient is retained as not only a funda¬ 
mental unit of the study, but if he is unusual m any observed 
respects, this information is earned through to the final reports 
so that the interested physiaan, investigator, or medical audit 
committee may make use of this most valuable bit of information 

This does not mean that statistical studies are not also carried 
out The study is well designed and monitored for uniformity and 
accuracy of data A surprisingly lai^e amount of information is 
obtained for use in study of diseases, their incidence, treatment and 
complications plus data which is useful for study of hospital utiliza 
tion and local hospital practices Over the years the permanent 
record of patient abstracts is added to so that studies can be made 
of trends which should allow prediction of new hospital needs 
and areas where economies could be introduced as certain types 
of activity are eliminated 

There are a vanety of monthly and semi-annual reports which 
are automatically sent back to each participating hospital for use 
of individual physicians, department heads, hospital committees 
and administrators 

Such a study should be a sumulaung and beneficial influence 
on medical practice m general The legal yardstick for quality 
of medical care has, until now, been the current practice in a 
particular community But, hospitals which elect to participate 
in a nation-wide survey of this sort arc, in a way, declanng that 
they belong to a nation-wide community and that they wish to 
compare certain aspects of their medical practice ivith the national 
average Comparisons between hospitals have been made as to 
the distribution of length of stay for patients according to selected 
admission diagnoses Thus, certain ho^itals and certain physicians 
have been identified in a confidential way If a physician is ex¬ 
travagant in admitung many insured patients simply for diagnostic 
purposes, this becomes evident If excessive hospitalization is used 
for patients with tonsil and adenoid operations or some other 
minor ailment or operative procedure, it is evident m the auto 
matic reports 
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All reports are coded m such a way that a phy^sician or a hospital 
remains a statistic only except to the people directly involved in 
that hospital Of course, if a group of physicians wish to publish 
the results of their participation in this study, they may do so 
One hospital in which the laboratory hemoglobin values were 
abnormally low studied the reasons The effect of correction of 
certain aspects of the laboratory procedure resulted in a sig¬ 
nificant nse in the hemoglobin levels being reported Along with 
this change, a study was made of the number of blood transfusions 
being used in the hospital and there was a very remarkable saving 
in numbers of transfusions This, of course, resulted in an economy 
m time, space and personnel and also must be recognized as a 
worthwhile reduction in nsk to the patients, since many patients 
must have been receiving blood transfusions who did not really 
need them 

Tliere are interesting ways of checking poor medical practices 
such as, for example, observing the number of patients discharged 
with sugar reported in the admission urine analysis, but tvith 
neither a blood sugar analysis nor a coded diagnosis of diabetes 
Studies have been made of the use of antibiotics m different hos¬ 
pitals, and over a number of years, together with a study of the 
relationship of antibiotic use to the use of the bacteriology labora¬ 
tory Review of data from seventy six hospitals comparing the ratio 
of antibiotograms to antibiotic usage in 1959 and 1961, for ex¬ 
ample, showed a 31 3 per cent over-all increase in this use of the 
laboratory In 1961, some 26 4 per cent of patients given anti¬ 
biotics were also provided wth specific sensitivity studies For 
hospitals with under 5000 dischai^es per year, the ratio in 1961 
was 0104 The highest ratio, 0 310, was observed in hospitals 
classified as medium large with 10,000 to 14,999 discharges per year 

Hospital staff sponsored quality control is generally accepted 
as important to the maintenance of a grade A hospital To do 
this effectively would require an unrealistic amount of physician 
time without setting up certain rules and limits for automatic 
screening of the bulk of records Automatic data processing per¬ 
mits common base problems to be assembled so that they can be 
recogmzed Patterns show up in certain areas and become self- 
evident as a result of the vanous types of reports and the per- 
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mutations and recombinations of data that have been used to 
bnng out such problems if they do, in fact, exist 

Although the primary purpose of PAS is to provide information 
of immediate use to the partiapating physicians and hospitals, 
the abstracted information is pcrmanendy stored and available 
for large scale statistical reviews which should be invaluable in 
epidemiological studies on a nation-wide scale It is a study which 
the smallest hospital may participate in without any investment 
in equipment and without duplicating posiuons for any or all of 
the highly trained personnel required to develop and run such a 
program 

Such a remote abstracting service, of course, does not solve 
the day-to-day data processing problems, many of which need to 
be on-line, and tvhich the ultimate and optimum system should 
include There is also a great likelihood that larger hospitals which 
have some access to computer equipment may mistakenly feel that 
an external program such as PAS is an unnecessary duplication 
of what could be carried on locally One total solution cannot 
at present be applied to all medical data processing problems 
Some problems require a small computer installed on the spot, 
other problems can be best handled m a larger computer center 
within an institution, and still other problems can most econom¬ 
ically be earned out by some larger outside agency 

With present-day competition for time, money and equipment, 
it may seem illogical to pay an outside agency to do compuUng 
when a large computer is available m the institution, but it ivould 
seem to be much more economical to use PAS or other outside 
services and to reserve the use of local computer facilities for other 
studies which cannot be as efficiently earned on outside of the 
institution Information can be abstracted from their national 
storehouse of data by PAS and sent to participating institutions 
in the form of reels of magnetic tape so that any desired special 
statistical studies can be carried out with use of local computing 
equipment 

Another important advantage of PAS is the institution of an 
optimum coding system which is standardized and acceptable to 
the majonty of physicians There is a great tendency for hospitals 
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and departments to destroy the effectiveness of a coding system 
by splitting off parts of it for redefinition This is always done 
in order to make the system better for a specific and immediate 
interest, but these interests chai^ and soon the system is so altered 
that case histones cannot be recovered by diagnosis No system 
can be a good general system and at the same time very specific 
m narrow areas of interest When there js continual disagreement 
between physicians or departments about the coding it is often 
easiest to get outside help Then those who dislike the system 
because it won’t do their research for them can blame the outsider 
instead of each other A good general coding system used by many 
hospitals must be used and maintained m a pure state If special 
studies require additional coding this is casj to arrange but it 
must be an addition to and not an alteration of the standard code 
Another example of a study of national scope is the American 
Medical Research Foundation Paranatal Study which is a data 
collection program designed to improve the medical care given to 
mothers and babies Again under the direction of a physician, 
Sydney S Kane, this program has gradually evolved to a system 
which tabulates information from 145 hospitals in forty-three dif¬ 
ferent states, District of Columbia and Puerto Rico The data is 
collected at one central headquarters, computer processed and 
then distnbuted back to the individual hospitals for their further 
analysis The data is sent back m three different forms One set 
of tables represents the individual hospital Another set concerns 
the regional area in which the hospital is located A third set of 
tabulations makes comparisons for the nation as a ivhole Tins 
program is also built around abstracts of individual patient records 
Although the original study was d«igned to improve neonatal 
and maternal care, there has resulted some interesting information 
of use to hospital administration and planning It is current prac¬ 
tice in some areas in the country to keep both mother and child 
in the hospital for a much longer penod of time If further studies 
reveal that no difference exists m maternal or infant mortality or 
morbidity by virtue of the longer stay, then a great saving in 
hospital bed utilization might be made 

Observations also have been made which indicate an aging of 
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the obstetrical community The progressive rise in multiparous 
mothers and a decrease in pnmiparous mothers indicates a long* 
range decrease in the requirement for maternal beds This, of 
course, is probably balanced by the increasing demand for geriatric 
facilities, but the trend generally and the trend in specific hospitals 
can be identified and could be used in planning of the requirements 
for the total hospital need and the most efficient distribution of 
facilities within a hospital 

The American Medical Association, under the stimulus of this 
type of study, has, through its committees, looked into the legal 
aspects of this type of data collection A number of states have 
adopted legislation, but the Nebraska Act is cited by the Com¬ 
mittee on Maternal and Child Care of the Council on Medical 
Service of the Amcncan Medical Association as an example which 
other states might follow in enacting such legislation as it is needed 
Tlie Nebraska statute protects the confidential patient-doctor 
relationship but makes possible the reporting of information about 
patients as long as it is kept confidential (codified) so that com¬ 
mittee work in hospitals, health surveys and other scientific investi¬ 
gations can be earned out The Legislative Bill 326 from the 
Seventy-second session of the legislature of Nebraska is considered 
a model bill * 

There is increasing interest in operations research or study of 
hospital costs and planning of ways m which automation might 
reduce expenses and increase the efficiency and productivity of a 
hospital Reviews of some of these studies have been made by 
riagle and also by Blumbcrg Flaglc has discussed the decision 
processes involved in balancing demand and cost of available 
service in any given area Tins requires knowledge of linear pro¬ 
gramming and stochastic theory if it is to be used to its full ad¬ 
vantage in solving hospital administration problems Several 
approaches may and probably should be used and physicians, 
administrators, operations analysts and many others must con¬ 
verge in the study of local hospital practices and solutions found 
in other similar institutions 


•Bill at end of chapter 
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LEGISLATURE OF NEBRASKA 


SEVENTY-SECOND SESSION 


Legislative Bill 326 


FINAL READING 


Introduced by Fern Hubbard Orme, 20th Distnct 
Read first tune January 25,1961 

Referred to comnuttee on Public Health and Miscellaneous Subjects 

Sent to printer January 26, 1961 

Final form sent to printer Apnl 17,1961 


A BILL 

FOR AN ACT relating to public health and welfare, to provide 
for Confidential information received by the State 
Board of Health, Nebraska State Medical Assoaation, 
allied medical soaeties on in hospital staff com* 
mittees in the course of a medical study for the purpose 
of reducing morfjidtty or mortality, to pro* 
vide that such information and maten^ so 
furnished may be used only for the purpose of 
advancing medical research and medic^ educa¬ 
tion, to provide for general publication of a 
summary of said studies, to provide an exemp¬ 
tion from legal liability for those furnish¬ 
ing such information and for those studymg 
and publishing the results and summaries of 
such studies, and to provide that such mate¬ 
rial and information and any findings or con¬ 
clusions of such authorized groups shall be 
privileged commumcations whidi may not be 
used or offered or received in evidence m 
any legal proceeding 

Be it enacted bji the people of the State of Pfebraska, 
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Section 1 Any person, hospital, sanitonum, 

2 nursing or rest home or other oi^anization may provide 

3 information, interviews, reports, statements, memoranda, 

4 or other data relating to the condition and treatment of 

5 any person to the State Board of Health, the Nebraska State 

6 Medical Association or an> of its allied medical societies 

7 or any in-hospital staff committee, to be used in the 

8 course of any study for the purpose of reducing morbidity 

9 or mortshty, and no liability of any kind or character 

10 for damages or other relief shall anse or be enforced 

11 against any person or oigamzation by reason of having 

12 provided such information or material, or by reason of 

13 having released or published the findings and conclusions 

14 of such groups to advance medical research and medical 

15 education, or by reason of having released or published 

16 generally a summary of such studies 

Sec 2 The State Board of Health, the Nebraska 

2 State Medical Association or any of its allied medical 

3 societies or any in hospital staff committee shall use or 

4 publish said matenal only for the purpose of advancing medi- 

5 cal research or medical education in the interest of reducing 

6 morbidity or mortality, except that a summary of such studies 

7 may be released b> any such group for general publication 

8 In all events the identity of any person whose condition or 

9 treatment has been studied shall be confidential and shall 
10 not be revealed under any circumstances 

Sec 3 All information, interviews, reports, state 

2 ments, memoranda, or other data furnished by reason of this 

3 act and any findings or conclusions resulting from such studies 

4 arc declared to be privileged communications ivhich may not 

5 be used or offered or recaved in evidence m any legal 

6 proceeding of any kind or character, and any attempt to use 

7 or offer any such information, interviews, reports, state- 

8 ments, memoranda or other data, findings or conclusions or 

9 any part thereof, unless waived by the interested parties, 

10 shall constitute prgudical error resulting in a mistrial 

11 in any such proceeding 
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W HEN physicians are first confronted with the idea of a com¬ 
puter m some aspect of clintcat practice, they often focus their 
attention on the use in differential diagnosis This is an aspect 
of potential computer application about which physicians seem 
to develop ather enthusiasm or strenuous opposition One opimon 
IS that the vast m&jonty of patients have problems that are so 
simple to diagnose as to obviate computer use and that those 
patients ivho have difficult diagnostic problems ^vl}l demand careful 
obser\ ation by the experienced physician in order to obtain data 
of diagnostic importance and that the important information is 
from a type of extrasensory perception that can neither be de¬ 
fined or communicated to the inexperienced let alone to a cold 
machine 

Some physicians are in the habit of consulting books listmg 
signs and symptoms and related diseases when they have an 
especially difficult diagnostic problem Physicians attending clin¬ 
ical pathological conferences, especially if they must discuss the 
differential diagnosis and the logic of reaching an opimon as to 
the disease process and the complications most likely present in 
the patient, are ell aware of the mass of literature and experience 
that IS available for review, even when the problem has been 
narroived down by the limited signs and symptoms of one patient 
There must be a sigmficant number of patients whose diagnoses 
could be assisted by automatic data processmg Diagnosis need 
not be limited to finding a single disease but should include a 
detailed appraisal of possible subsidiary conditions, %veaknesses 
and potentials for disease Physicians need to know all they can 
leam about their pauents and machines do help in gathenng and 
arranging data for more cfiicient use Vannevar Bush, in 1953, 
37 



38 


Computer Appheahans in Medicine 


observed that too much was required of memory and not enough 
time was left for synthesizing thought He suggested that elec¬ 
tronic means be exploited to take over the data storage and 
retrieval areas m medicine 

Nash, 1954, observed how inadequate a textbook is for making 
the many comparisons required in differential diagnosis He ex¬ 
perimented with a special ruler with about 300 diagnoses listed 
on the side and multiple inserts each labeled as a sign or symptom 
and with lines marked opposite diseases when the sign or symptom 
was present in that disease The diagnostician would use such an 
aid by selecting those inserts corresponding to the observed positive 
signs and symptoms and when some disease was found with the 
lines across all or most of the inserts a diagnosis could then be 
considered likely 

Such a comparator i$ illustrative and instructive It is a land 
mark in a shift of thinking from books on differential diagnosis 
to automatic data processing and storage equipment for the busy 
diagnostician, but all such methods arc pnmanly aids and will 
still depend on the experienced physician whose senses are trained 
to collect the most perunent data with great skill and some ac¬ 
curacy Physicians freed to observe and converse with their 
patients would feel that their time was used in a more productive 
way If they recognized a disease at a glance no source material 
would be needed When the diagnosis is difficult the accumulated 
knowledge should be more available and in a quantitative and 
more certain way 

In 1957, Lipkin and Hardy reported a study of the data charac- 
teristjcs ^f hematologic diseases as coded for application to mar¬ 
ginal punch cards The data included information from the case 
history, physical exammauon, and laboratory examinations Stand 
ard textbooks on hematology were consulted during the formula¬ 
tion of the problem and master cards were prepared characteristic 
qC each, bnmatxiln^ disease Considecahle pretimmary planmng, 
and preparation was carried out with marginal McBee punch 
cards, and the results of these studies have been published More 
recently, Lipkin and fellow worken have transferred this problem 
to a digital computer 

They use three magnetic tapes Tape A has the case history 
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data from each patient, and the first step is to compare this matenal 
with Tape B which contains definitions of classes of disease T^venty 
diseases were chosen for study and these were grouped into nme 
different general classes The first class, for example, contains the 
diagnoses pemiaous anemia and tropical sprue Tape C contains 
definitions of each disease within a class and after the comparison 
has been made between Tapie A and Tape B, then the appro- 
pnately suspicious classes are selected from Tape G for further 
comparison wth the patient and further differentiation of the 
diagnosis from class to specific disease results 
The program is designed to make a positive diagnosis when all 
of the cntical items of a disease, as defined m the Tape C, are 
satisfied by a given patient’s data If some cntical item has been 
tested and found absent, the disease is still retained in the dif¬ 
ferential diagnosis and the appropnate information is prmted out 
suggestmg that the physician rebeck this item If two or more 
cntical items are found to be absent, the disease is discarded, but 
the pnntout explains ^vhat information was absent If one or 
several cntical items have not been tested, then the program lists 
information which is required before the probability of a diagnosis 
can be stated A weightmg techmquc is used for the signs and 
symptoms and laboratory findings m the disease definitions and 
important negative findings arc also weighted The construction 
of disease classes serves to reduce computer processmg time by 
an estimated 50 per cent 

In this particular study, the diseases are fairly well defined, 
but, if such a program were expanded to include other conditions, 
a problem might arise due to the vagueness in the defimtion of 
certain diseases As this program is given a more thorough chmeal 
trial. It will be possible to alter the weighting factors by appropnate 
statistics! tcchntqiscs sad t<j add additiim^ diseases sad dasses 
of diseases as it becomes evident that these are required 

Lipkm has indicated that it is desirable to list the diseases in 
the differential diagnosis m sequence of decreasing probability 
He also emphasizes the possible usefulness of a program of this 
sort as an aid in teaching differential diagnosis and that its use 
m medical practice might contnbute to an increasingly analytical 
(less intuitive) basis for diagnosis If the computer were used to 
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store large amounts of factual infonmatton which now a physician 
must gather from articles and textbooks or from his own memory 
for each diagnostic problem, then perhaps the physician’s time 
could be concentrated more in the area of onginal thought and 
less in the area of a filing clerk 
Warner. Toront o. Veasey, and Stephenson prepared a matrix 
of probabilities based on a'study on,035TSatlenis^Uirsuspected 
congenital heart disease who were referred to their laboratory for 
diagnostic catheterization and other special studies They listed 
fifty symptoms and signs and thirty-three diagnoses The incidence 
of each symptom in each disease was transferred to punch cards 
and stored in the computer along with the program for carrying 
out the analysis For each subsequent diagnostic problem patient 
analyzed, the symptoms from a check list were punched on a 
card which was run through the computer A conditional prob¬ 
ability equation was then worked out automatically with the 
appropriate probabilities from the table and finally the diseases 
with probabilities greater than one per cent were printed out 
together with their respective probabilities 
Thirty six cases were subjected to this program The “true" 
diagnosis was made by physiologic studies and observations or at 
the time of operation The material was also submitted to three 
experienced cardiologists The result was that the computer diag¬ 
nosis was m agreement wth that of the cardiologists The dif¬ 
ferential diagnosis from the equation was frequently more com 
plete and appeared more logical m retrospect to the clinicians 
than the differential diagnosis listed by each of them prior to 
seeing the equation’s prediction These workers now plan to use 
the computer to recalculate the probabilities periodically as ad¬ 
ditional data accumulates in the further use of the program In 
a sense, then, the program ^vlll “learn” from experience by virtue 
of refinements of the table based on the accumulating data It is 
also possible to study the effect of modifying the symptom set 
for a given patient These workers report an enthusiastic accept¬ 
ance of use of this technique in the teaching of medical students 
The formulation of a diagnostic problem m this framework 
requires a very exact definition of each symptom and each disease 
It requires an understanding of which symptoms are exclusive 
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and which ones are interdependent The method requires that 
only one sign or symptom be used from several having the same 
information content The important determinants of the diagnosis 
are emphasized and symptoms which have little information con¬ 
tent can be recognized and eliminated A student or physician 
testing his collections of data in such a study should develop a 
more acute sense of the relative value of the data that he collects 
and should become more effiaent in his histoiy-takmg and physical 
examination He should learn to ferret out those items which have 
the greatest information content 

So far the approaches discussed relative to differential diag¬ 
nosis have followed a pattern of comparing diseases with signs 
and symptoms The refinements have consisted primarily of refine¬ 
ments m data handling and m the use of probability statistics 
These studies demonstrate what a tremendous amount of infor¬ 
mation must be collected and stored m tables even for a very 
narrow area of medical practice There is no question but that 
programs can be developed by this means if enough interested 
and highly trained physicians throughout the country will con¬ 
centrate on defining diseases and their related signs and symptoms, 
and wiU collect the necessary data for preparation of the reference 
matenal both from the literature and from further prospective 
protocol studies of patients There is the problem of differences 
m geographic effects and the influence of epidemics A matrix 
might continually change, less rapidly for certain broad types of 
diseases such as congemtal defects and more rapidly for infectious 
diseases It is common for physicians to make rather difficult 
diagnoses consistently and occasionally in error simply because 
there has been an outbreak of that particular disease m the area 
There may be ways of developing differential diagnosis pro¬ 
grams which do not require as much searching of unimportant 
data in order to focus on the most pertinent aspects of the problem 
Newall and Simon have su^eslcd a means of study of human 
EEmkmg wfuch'is somewhat anali^us to simulation experiments 
which will be menuoned m a later section Clynes, for example, 
constructed a computer with parts representing the various com¬ 
ponents of the respiratory-heart rate reflex mechanism and placed 
man and machme side by side Recordings of respiratory movement 
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were taken from the man, fed through the computer, and the 
computed heart rate was then compared with the patient’s actual 
heart rate Adjustments were made m the machine until it cor¬ 
responded with the human Whether or not the analog was a 
unique solution, the very construction of it and testing of it under 
a variety of conditions led to new ideas about the way the reflex 
system works This is mentioned here in some detail because it 
serves to introduce the suggestion of Newall and Simon They 
would make a somewhat similar comparison of problem solving 
by man and the problem solving analc^ called GPS (The General 
Problem-Solving Program) GPS has not specifically been de¬ 
signed for use in differential diagnosis It is a very general program 
It IS a possible analog of the way humans solve problems It is 
software which can be used with any moderate sized digital com¬ 
puter There is no assumption of similarity between either the 
brain and the GPS program or between the brain and the com¬ 
puter used Newall and Simon have suggested that we observe 
how GPS and a human subject solve the same problem Both the 
brain and computer programed with GPS are devices for manipu¬ 
lation of symbols and for problem solving behavior To watch man 
and programed machine work on the same problem would have 
the virtue of developing a specific diagnosis program useful in 
medicine while, at the same tune, exploring the way in which 
an experienced and capable physician functions in making his 
differential diagnoses 

The drawback to this approach is that a rapid easy solution 
applicable for early demonstration of clinical use would be un¬ 
likely A great deal of study of physicians’ logic and modification 
and amplification of the problem-solving program would prob¬ 
ably be required The large electronic storehouses of medical 
knowledge would still be needed, but perhaps they could be used 
more efficiently Perhaps we could find out to what extent the 
intuition of physicians is simulated by a probability model Cer¬ 
tainly physicians do not cany around m their heads large matnccs 
of correlation coefficients between all possible diseases and all 
known signs and symptoms Such a storehouse is, in fact, even 
beyond the capacity of any computer yet designed 

The human brain seems to be capable of developing very clever 
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ways of figunng out the relationships needed from a very limited 
amount of actual memonzed material Physicians think of diseases 
in a dynamic context as a sequence of changes occurring in cer¬ 
tain organs, and they remember the signs and symptoms because 
of their understanding of the mechanisms which bring about these 
indices of the hidden disease process They do not usuallj memonze 
long lists of signs or symptoms any more than we remember stories 
by memorizing a list of nouns, verbs, adverbs, etc There are 
many ways of classifying diseases on the basis of anatomy, physi¬ 
ology, bacteriology, patholc^, etc The clever diagnostician re¬ 
members many little relationships because of some common 
denominator which may be rather obscure on superficial examina¬ 
tion of the actual logic used at the bedside This is the expencnce 
of which physicians are proud This is nhat appears to be artistic 
ability in the practice of medicine and why it may seem to many 
physicians that they are gifted m making correct decisions without 
definable data and why they profess that rare diagnoses can be 
correctly made when no statistician would dare stick his neck out 
on the basis of the data presented 

There is, then, a large job to be done by the student of physician 
logic or of human brain problem-solving We must find out how 
the best diagnosticians function, what information they use and 
how they use it to make the astute diagnoses that they make There 
must be also, at the same time, review of the way in which phy¬ 
sicians make mistakes The preparation of programs by this 
method will be carried out in occasional spurts of brilliance but 
chiefly through jlong, hard, seemingly unrewarding labor Con¬ 
tributions will be made by individuals and groups of many interests 
and levels of training It will require considerable teamivork, but 
the resulting program will contain the fruits of all of this labor 
and better diagnoses should ultimately be possible with the pro¬ 
gramed cumulative bnlliance of all of the best available brain 
power How soon a workable medical GPS will be available 
depends on our interest and effort. 

Lusted ha s estimated on the basis of some preliminary work 
done in hematology that the necessary work required to examine 
the entire body of medical literature and to collect new data on 
the basis of carefully defined standards would be m the range of 
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500 to 600 man-years The rewards of such effort might be, 
according to Lusted, the opportunity of a physician to contact 
his regional center, supply the available data, and request an 
immediate differential diagnosis list of the most probable diseases 
together with information as to what additional examinations 
might be necessary to make a deBmtc diagnosis He would also 
be able to obtain the latest information on the treatment of the 
disease and its expected prognosis Lusted suggests that the final 
mathematical model of medical diagnosis may involve a sequence 
of a number of different approaches, including (1) logical analysis, 
(2) probabilistic analysis, and (3) value theory to assist in the 
[choice of a treatment plan 

It appears at the present time that the student of computer 
diagnostics must have a background in probability theory and 
that he might be helped by the logical algebra of George Boole 
or, according to Swisher and Agahdes, by the related symbolism 
of McCulloch-Vcnn and Karl Menger’s Functor algebra The 
symbolism finally used by any one individual worker depends as 
much upon the influences of past experience and education m 
logic as it does on die diagnostic problem at hand. It may also 
be influenced by the worker’s acquaintance wah computer tech¬ 
niques A computer technologist working with analog computers 
might prefer a symbolism which was more immediately transfer¬ 
able into diagrams of switching theory The digital computer 
expert might be more inclined to use conditional probability, 
multiple regression or factor analysis techniques The statistical 
approach of multiple regression would be helpful in setting up 
the a priori probabilities given a sufficiently large number of case 
studies and would then be applicable to the processing of an 
individual patient’s diagnostic problem tlirough the use of these 
probabilities The present state of the art indicates that computer 
diagnosis programs are practical and could be useful tools for the 
busy physician or for the physician confronted with difficult 
diagnostic decisions 

Before such programs become an actuality for many of the 
disease categories and for specific gei^aphic regions there must 
be a tremendous amount of woric both in development of programs 
and in the collection of large volumes of reference information 



Dtagnoas 


45 


regarding actual patients wth knmvn diseases of all kinds For 
certain data such as routine laboratory tests, the recording of 
data offers no problem There are problems in the collection of 
data from history and physical findings m a form \vhich can be 
conveniently stored and at the same time will be acceptable to 
the physician \vho must make the initial observation and recording 
There are some observations which a machine can make more 
accurately than can a physician 
There are also some observations that a physician cannot make 
which require a computer such as the electrocardiograph To the 
extent that the physical exanunation is automated, decisions wll 
need to be made as to the kinds and amounts of data which are 
to be recorded and a good part of the computer application will 
probably be on hne and dynanucally programed so as to retain 
only observations of diagnosuc significance and to avoid the 
storage of large amounts of normal data ^^hlch might conceivably 
contain discriminatory information but which the busy physician 
would never get around to analyzing 
Whatever approach is used physicians ^vl^ be made more rather 
than less necessary since they will need to make the important 
decisions, collect data that no machine can collect and contribute 
to the program development that determines the usefulness of 
computers The efficiency of any program will be related to the 
intensity with which it is used m the local situation since many 
ill defined ethnic, social, psychologic and epidemiologic factors 
may influence the pattern in which a standard set of questions 
are answered The conclusions reached and programed into an 
automatic interpretation of the Cornell Medical Index or the 
Minnesota Multiphasic Personality Inventory, for example, must 
be revle^ved in many different circumstances 

This is one of the areas of clinical research which has been 
shown to be feasible but which will need implementation and 
co-ordmation in numerous centers and probably in satellites of 
these centers Sets of questions have been formulated Simple 
acceptable methods of havmg these questions answered have been 
worked out Programs have been written for automatically proccs 
smg the tests and evaluating the results Validation of results 
remains and will always remain an important and exciting task 
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for physicians The physician must decide upon the final criteria 
in any given instance as to the accuracy of all analyses whether 
they are earned out by physicians with their unaided senses or 
through intermediate machines and laboratory workers The com¬ 
puter IS one more tool to help in the never ending struggle to know 
and to control our environment and the diseases that afilict us 

Brodman, van Woerkom, Erdmann and Goldstein reported m 
1959 on a study with the Cornell Medical Index (CMI) which 
would seem well worth while repeating in any center where 
physicians are interested m contributing to and becoming involved 
in the automation evolution that is beginning They collected from 
nearly 6000 consecutive male patients as an initial sample Some 
sixty diseases were then studied and the pattern or score on the 
CMI was computed for each disease Data were then collected 
from 2,745 consecutive patients beginning January 1, 1956 and 
on the basis of the previously computed probabilities of each 
complaint for each disease the new patients were assigned Jiic- 
Iihood indices for each of the sixty diseases studied A diagnosis 
was considered as made when the index reached a certain level 
The computer diagnosed 44 per cent of patients having the sixty 
diseases and identified the organ system diseases in 55 per cent 
When a random sample of 350 questionnaires were examined by 
both computer program and physician there was no statistically 
significant difference in the results except that physicians tend¬ 
ed to diagnose psychoneurosis more accurately than did the 
machine 

Swenson and others in 1962 have reported on their initial ex¬ 
perience With a personality inventory test which could be used 
alone or in combination with the Cornell Medical Index and 
might bolster the weakness of the latter in determining psycho- 
neurotic tendencies as well as pointing out major neuropsychiatric 
problems This copyrighted text which was developed by Hatha 
way and McKinley at the University of Minnesota and called 
the Minnesota Multiphasic Personality Inventory (MMPI) con¬ 
sists of 550 yes-no questions which were originally standardized 
on a group of normal persons visiting patients at the University 
of Minnesota Hospitals, persons m the employment of the Works 
Progress Admimstration, high school graduates who had not yet 
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entered college and patients m the University of Minnesota Hos¬ 
pitals who were under treatment for nonpsychiatnc illnesses 

Patients were then selected who demonstrated rather dear-cut 
psychiatnc problems which could be classified in one of eleven 
different groups, namely hypochondriasis, depression, hysteria, 
psychopathic deviate, masculinity femininity, paranoia, psychas- 
thenia, schizophrenia, hypomania and social mtro% ersion Grading 
the test consists of determining whether an individual patient 
ansivers the questions in a way consistent with the onginal normal 
group or with one of the psychiatric group catcgoncs There are 
some additional keys which have been developed to validate the 
“test taking attitude” of the individual and are said to be reason¬ 
ably effective in detecting low reading ability, malingenng, lying, 
confusion or defensiveness 

This test has for some tune been machme scored at the Uni¬ 
versity of Minnesota but the Mayo Clinic Study has added to 
this a program which automatically converts the score into those 
standard interpretative statements that are used by psychologists 
The following is an actual example “Tends to miniinize or smooth 
over faults m self, family and circumstance Somewhat tense and 
restless Considerable number of physical complaints Prominent 
concern with bodily functions Moderately depressed, worrying 
and pessimistic Conscientious, orderly and sclf-cntical Conforms 
reasonably to social codes and morals Has a combination of prac¬ 
tical and theoretical interests Respects opimons of others without 
undue sensitivity Normal female interest pattern for work, hobbies, 
etc Has capacity to maintain adequate social relationships ” 

It IS apparent that the latter portion of this example is describing 
a normal individual, but the mitial statements, indicatmg ab¬ 
normal patterns, could be helpful in preparing the way for a more 
rapid understanding between the patient and his physician The 
patients look upon the test as another effective means of com¬ 
munication which fills in the tune that otherivise might be “ivasted” 
while they are waiting for appointments with their physician or 
consultants or for other special laboratory tests and examinations 
The test is presented to the patient in the form of a small pad of 
cards and a special electrograpluc pencil and takes one to one-and- 
a-half hours to complete The cards are collected together twice 
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daily and processed and the reports are pnnted out on a chain 
printer on paper which can tlicn be inserted m the patient’s 
record Each record requires about thirty-five seconds for processing 

This approach of allowing the patient to take some of lus own 
history by the use of standardized questions and mark sense cards 
has been amplified in the Tulane study to include forms which 
arc made out by the physician, nurses, and technicians so that a 
great deal of the data which is usually found m the patient’s 
hospital chart can be machine processed, assembled and in a 
limited way analyzed There is no reason why many of the physical 
findings usually written in a physician’s record could not equally 
well be recorded by the use of a check list Information which is 
so specialized and unusual as to be unavailable in any standardized 
set of questions or check list could still be wirtten out and main¬ 
tained m that form or transferred to the machine record by tech¬ 
niques which arc quite similar to the use of an ordinary typewnter 
Any data that can be coded for machine processing and storage 
becomes available then for research m ilic field of diagnostics, 
the study of probabilities, factor analysis and stochastic processes 

In such an arrangement each individual patient contributes to 
the information content of (he whole organization This has always 
been true to the extent that individual patients contribute to the 
experience of the physicians canng for them, but this experience 
could become much more rigorous if added to accessible electronic 
storage and would no longer depend upon the fallible human 
memory TIic physician rn (he future wJJ have avaiJabJc such 
extensive and accurate information that he can devote more of 
his attention to nonroutinc decisions which no machine can 
accomplish 

In the beginning there is no obvious reason why a group of 
physicians m a small hospital without computer facilities could 
not participate m such a study as a “satellite” with very little 
local investment in equipment This would have the added ad¬ 
vantage for the central study of adding to the size of their sample 
as well as broadening the base from which the sample is obtained 

Automation may make possible*the earlier recognition of some 
medical problems through the use of mass screening Smith, Miller, 
Manenfcld, Hahncman and Willard reported in 1959 a study of 
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3,272 tape recordings of heart sounds m children in which a 
technician recorded approximately 250 children per day and a 
physician listener subsequently screened these at the rate of 140 
per hour Seven children were found with definite heart disease 
and only two of these were previously known Gerbarg and others 
have recently shown that a digital computer can be programmed 
to analyze phonocardiograms but they feel that more work needs 
to be done in recording techniques in order to make full use of 
a full-frequency spectrum and to minimize noise Wherever the 
job done is repetitive and where fatigue and emotion might have 
some effect on human effectiveness, machine applications can be 
considered The machine is able to remember, compare, compute 
and make decisions on the basis of predetermined criteria 



Chapter TV 

ELECTROENCEPHALOGRAPHY 
AND ELECTROCARDIOGRAPHY 

Computers arc forcing man to see more of the art in science, 
more of the forest along with the trees, the patterns within pat¬ 
terns, the difTcrcnces in viewpoints and interpretation and iJie 
difficulties of being really sure about any interpretation Current 
investigation of clectrocnccphalc^aphic tracings, electrocardio¬ 
grams, photoroentgenograms and other patterns will be discussed 
m an effort to emphasize the need for a pluralistic approach m 
analysis and the benefits to be derived from computer use in these 
endeavors 

Most of us for years have had an oscilloscope at home with 
winch we allow our children to play (TV) and each of us has a 
portable 10 watt computer that is priceless, infinitely complex and 
not too accurate in our respective cranial vaults Man carries with 
him in his five senses the penpheraj transducers and the network 
for obtaining data plus all of the servomechanisms required to 
keep the system working at a level that may provide usef^ul main¬ 
tenance, reproduction and occasionally some creative output 
Once a year a fc^v thousand of these complex living systems 
gather together in a large broadcasting studio m Des Moines, 
Iowa for an All-State Music Festival to listen to an orchestra of 
200, a band of 270 and a chorus of approximately 600 high school 
musicians It is interesting to sit in one of the scats in a far corner 
of the balcony and think about the complex data flow, analysis 
and “creation” that is avaibblc m such a spectacle and what a 
wealth of material the investigator in electroencephalography fias 
at his command No individual in the entire amphitheater receives 
exactly the same information None of the messages arc pure The 
50 
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hail acoustics, though good, arc far from perfect There are some 
baffles hanging from the ceiling which probably reduce some of 
the reverba-ations 

Sometimes the 1000 performers and the audience seem not 
unlik e a large model of a smgle bram with a diffuse alpha rhythm 
and superimposed more melodious and informative strains winding 
their way through the horns and the strings to the finale which is 
climaxed by a raucous discharge of clapping from the entire 
“lower” centers This analogy is also not unlike a huge computer 
with the conductor acting as the control and the music on the 
stands and programs in the hands of the audience functioning 
as the software or program for this particular run But there is 
more to the analogy The observer in the balcony is aware of 
intruding television cameras and microphones which probe and 
record the entire operation and then less than 24 hours later we 
su before our oscilloscopes, each in his own laboratory, and 
attempt to obtain some of the original cxpenence For the ob 
server of the oscilloscope who was also present during the original 
performance, it is obvious that a great deal of distortion has crept 
into the expenment There is a ivaxing and wamng of sound The 
striking colors of band uniforms mixed together from the entire 
state and the colorful formal dresses are now black and white and 
the clear detail is lost except on the close ups where it must be 
confessed that the oscilloscope gives considerably more detail than 
could be obtained from a comer balcony scat Some of the loss 
of the onginal signal developed m the recording and some in the 
rebroadcast We can identify some of the artifact as due to the 
periodic running of an clcctnc tram m a room above the television 
set or the over flight of an airplane There also seems to be a 
change which depends upon the characteristics of the ongmal 
program The orchestra seems to have been reproduced the 
poorest, the choral group the best and the band somewhere in 
between Perhaps the choral group has the best chance of pro¬ 
ducing a very clear and concise signal and a collection of stnnged 
instruments is more difficult to control 

All of the problems mentioned in connection with the repro* 
duction of a musical event exist in the use of electromc equipment 
to record and anedyze the electrical activity of the brain The 
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electroencephalographic tracing is a collection of the loudest signals 
circulating m some 10 billion cellular elements and the propagation 
of each individual signal involves its successive hundreds of neurons 
Every sensation of which the individual is aware from his periphery 
and his internal transducers has its counterpart in the brain 
There are, in addition, the facilitated spreads of activity through 
neuronal patterns of memory The electrodes recording from points 
on the scalp are separated from this wealth of data by a filter of 
bone and soft tissue The electronic equipment components each 
add their own impedance and distortion 

It is not surprising in view of the complexity of the problem 
that limited progress was made in analysis of electrical acuvity of 
the brain until statistical methods and computer technology were 
recently applied to the problem Human endeavor and brain 
function arc primarily statistical in character It has been said 
that God (and we might add mathematicians) study the umvene 
and have no need for statistics (or computers) Ordinary men study 
only samples of the universe and their creative energy is spent in 
a continuing research of old ground in an effort to understand 
phenomena as if they were observable in their true mathematical 
perspective Ordinary men need all of the help that they can 
obtain from computers, statisticians (and mathematicians) 

The research worker is continually searching for some unexpected 
rhythm in his rccordmgs and m the process he, of necessity, must 
lose most of the real information containing data which he calls 
“noise,” variance or “hum ” When the investigator finds some 
unexpected rhythm he immediately begins to analyze it to see 
how much information is really hidden therein, but because it is 
a rhythm and to the extent that it is repetitive, monotonous and 
redundant, it ceases to have any additional information content 
It IS a paradox that man strives continually toward recognition 
of order in the laivs of nature and yet information theory relies 
upon discovery of the unexpected *1116 unexpected is, however, 
a new predicability in rhythm and not more noise, chaos or 
entropy 

Many theoretical models are usal m an effort to explam the 
basic funcuonal design of the brain as observed m the experiments 
which computers must help process In addition to the expen- 
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mental models there are other models which we call statistical 
methods These are mathematical models and each requires cer¬ 
tain well defined characteristics in the distribution of the data in 
order that the derived statistics be valid, unbiased estimators 
Whether a data source is sending information in a normal, log 
normal, Poisson or some other distribution, if the statistical model 
specifies a normal distribution of data a transformation may be 
in order before the statistical method is applied An average or 
standard deviation is umnterpretable if the data has a marked 
skewness 

The desirable transformation should probably be derived from 
data collected from the brain during appropnate control periods 
of the experiment and this is one of the points at which judgment 
must be used by the investigator The entire application of sta¬ 
tistical methods implies repeated judgments of the research worker 
There are some statistical models that are not dependent upon 
the type of distribution and even a skewed distribution can be 
analyzed with Gaussian statistics if the spread is not too great 
Whenever the standard deviauon is as large as the mean there 
must be some question about the use of non normal, untransformed 
data Whenever possible the investigator should retain the onginal 
data or at least a portion thereof for study and display along with 
some of the derivatives and conclusions so that the extent and some 
of the nature of the loss of information can be observed 

One test of the specific analysis which is programed or wired 
into a computer is the accuracy ivith which denved statistics can 
be used to regenerate an onginal signal (without the onginal noise 
and ivith a minimum of new noise) nus can never be completely 
accomplished because of the inevitable loss of information that 
comes about m every reduction and transformation of the data 
This type of check of the hypothesis regtirding specific procedures 
of data reduction is somewhat analogous to the use of Koch's 
postulates in a bactenologic proof In bactenology the isolated 
infectious agent must be earned through cultures outside of the 
host and then the disease must be reproduced by the isolated 
agent before it can be accepted as the basic causal factor m the 
onginal disease 

In the brain messages arc received from many types of penph- 
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eral transducers sensitive to diflerent modaliti« of heat, pressure, 
vibration, pain, sound and light The signals undergo a pro¬ 
gramed analytical breakdown into essential and significant com¬ 
ponents and there is then rcsynthesis into a complex projected 
multisensonal image which seems to be at a given position in the 
four dimensional time-space that man occupies Computers are 
programed to function somewhat like their human counterpart, 
to take complex signals apart and to resynthesize them Machine 
computers are much less redundant than the brain because they 
are composed of fimte pathways and function in a nonstatistical 
manner, even though many of the problems computed arc statist 
ical A message through a machine follows along single wires and 
through isolated elements There arc a few redundancies m the 
form of checks of the computer equipment and checks for some 
of the most common mistakes that come about m human pro¬ 
graming The machine computer, however, has nowhere near the 
redundancy of the brain computer A message coming through 
the brain is not unlike the onrush of traific at a busy intersection 
when the light changes to green It involves successive hundreds 
of neurons and each neuron ts connected to a hundred more in 
a network of potential redundancy There arc thresholds, counter¬ 
acting neurons and enzymaucally controlled chemical inter 
mediancs between the neuronal synapses which help stabilize 
human behavior and make possible continued function even at 
times when relatively large segments of the neuronal pop¬ 
ulation arc completely destroyed To really progress rapidly 
ID study of brain function or cybernetics, as Norbert ^Vlcncr has 
termed the study of control in man and machmes, there must be 
multidisaplinary teamwork and pluralistic mcthodolt^ 

Because of the statistical problems involved computerized elcc- 
trocnccphalographic study will be of continued interest throughout 
the biomedical and behavioral sacnce Whereas the biochemist, 
depending upon the nature of his hypothesis, may have to work for 
months to obtain enoogb data to justify one analysis, the electto- 
cncephalographcr is literally drowned in data with the flick of a 
switch It V, ould be safe to predict that the electroencephalographcr 
will become the applied statistician supreme It is not surpnsing, 
therefore, that the clcctroencephale^rapher, m his unguarded 
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moments, refers to the problems of the electrocardiolc^t as bejng 
trivial Much of the problem up until now for the electroenceph- 
alographer has been to obtain a recogmzable signal, a predictable 
pattern within the noisy onrush of electncal activity which ong- 
inates throughout the nervous system but focuses in the brain 
Whereas other research workers have been able to use the large 
and somewhat inaccessible computers of business, the brain re¬ 
search worker has required and built special equipment for on Ime 
computer applications m the laboratory One common technique 
is to use a stimulus to the experimental subject which also tnggers 
the control section of the computer and from selected electrodes 
on the scalp the recording is shown on an oscilloscope With one 
stimulus there is no identifiable effect m the tracing, but then a 
second identical stimulus is given and the oscilloscope shows the 
average of the two tracings As the stimulus is repeated over and 
over again, the averaging of the curvilinear elcctroencephalo- 
graphic tracing becomes less and less noisy and more and more a 
definite signal which is related to the stimulus at zero tune All 
activity from the brain which is unrelated to the stimulus is con¬ 
sidered as noise to be averaged away and the operator continues 
the experiment until the signal/noise ratio reaches the desired level 
A second type of computer is designed to compare tracings from \ 
two points on the hemisphere or from both hemispheres in a 
subject or from two subjects who arc receiving the same stimulus 
This computer has a mechanism for delaying one of the signals, 
multiplying the two signals together and mtcgrating the result 
from a selected segment of the tracings The recorded brain waves 
are played over and over ivith varying time delay of one of the 
tracings in order to find the appropnate tune delay which will 
give the maximum correlation beuveen the two traangs One of 
the objects of such a study is to be able to trace the propagation 
of a signal between different recording sites 
Once signals are separated from the background of umvanted 
electncal activity it is common then to compute the amplitude at 
varying tunc intervals foUowmg a stunulus and to study the 
relationship between vanous amphtudes and the latency One of 
the problems of averaging which is also used m correlaUon studies 
IS that the subject is responding repeatedly to the same stimulus 
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which means that the experimenter is investigating a rather 
monotonous form of brain function Some of this problem is per¬ 
haps eliminated by using a randomly repositioned light spot on 
an oscilloscope and recording some additional information from 
the penphery as to the position of the individual’s eyes, the orien¬ 
tation of the head in space as well as the position of the object 
being tracked by the experimental subject Barlow, using this 
approach, averaging techniques and scalp electrodes, observed 
that It was possible to separate the electrical activity of the brain 
related to shift in spot position from that related to changes in 
eye position 

The amplitude of the electroencephalographic activity is larger 
durmg sleep as can be seen in the onginal tracings Study of the 
^ pattern of rhythmic burst activity, or the underlying alpha rhythm, 
IS less obvious in original tracings and is helped by computer tech- 
mques It has been possible to show that when an individual is 
placed m a dark, anechoic room that there is a progressive de¬ 
crease m the abundance of rhythmic burst activity which is 
detectable during as brief a penod as ten minutes It is also possible 
to drive the rhythm of the brain with a flashing light so that two 
subjects can become locked into the same phase It may be that 
this phase activity is a mechanism that the bram has for erasing 
some of the noise and preparing the bram for association functions 
required in higher thought processes If all activity is eliminated 
by some means no thinking can take place The brain is again not 
unlike electronic or inertial equipment which is easier to regulate 
or modulate m a state of motion than from a state of suspended 
animation 

It IS apparent that as more is learned there will be applications 
of computer analysis of electroencephalographic data m clinical 
applications in diagnosis The electroencephalograph has already 
been used to trigger a servomechanism for the regulation of the 
administration of anesthetic agents It is to be expected that a 
senes of statistically denved profiles for help in diagnosis of vanous 
psychiatnc disorders will be greatly facihtated by computer data 
I processing The computer then, m summary of electroencephalo¬ 
graphic work, has the double duty of assisting in identification of 
the signal in a background of noise and then once the signal has 
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been identified a second analysis is required to determine the 
amount of information available m the tracing There are special 
studies being made of neuroelectncal activity in such organs as 
the olfactory bulb and its correlation with central activity of the 
brain 

A special field of investigation is built up around electro- 
retincgraphy, recording and analyaing data obtained from the 
surface of the eye Certain parts of the observed pattern seem to 
be produced by the cones and other parts by the rods It has been 
possible to find useful differences in the patterns produced in 
patients with varying degrees of retinitis pigmentosa by using a 
light at a position in front of the patient and then taking recordings 
from both eyes with the patient looking at fixed angles to the nght 
and left These latter applications were made by Peter Guross 
and use the principle of evoked response for evaluation of visual 
function m a clinical problem 

Harold Shipton at the University of Iowa has built a computer 
which belongs to the family called toposcopes This instrument 
records simultaneously from sixteen or more electrodes attached 
to different areas over the surface of the scalp, automatically com¬ 
putes correlograms from each combination of electrodes and dis¬ 
plays these on sixteen cathode ray tubes so that time is represented 
by the continually moving clockwise radians and the point of 
correlation m time can be related between points on the skull 
through visual inspection or recorded photographically This is 
a technique used in submarine search and the instrument has some 
characteristics for information recovery that exceed the predicted 
and that have not been explained This type of instrument exploits 
the complexity and volume of signal available in the brain and 
adds a broader geographic or topologic dimension to correlation 
in time 

There is another type of computer use that ivill be mentioned 
m greater detail in the next chapter where systems, feedback and 
transponder functions are reviewed but which deserves special 
attention in relation to electroencephalography The brain is the 
Center of control for many body systems, for internal homeostasis 
and for proper onentation of man in his external environment 
An important aspect of this is feedback and Shipton, working 
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with r L Golla and W Grey Walter and others at the Burden 
Neurological Institute, has also been an early contnbutor to this 
area through design of “trigger” circuits which allow the experi* 
menter to introduce positive feedback and to throw the brain into 
oscillating activity which can be used for identifying patients with 
“epileptic tendencies” as well as for investigative studies 
This “tngger” circuitry functions to receive, smooth, amplify 
and transmit basic patterns of the brain waves so that a stimulation 
signal such as a flashing light can be fed back to the subject in 
any desired time delay relationship to recorded ongoing brain 
rhythm Normal subjects or patients with only minor psychiatnc 
disturbance report that such a feedback flashing light causes 
varying sensations Some report a subjecuve sensation of change 
in color of the light Others note a sensation of movement or 
“swimming ” Mood of the patient influences the type of response 
of a given individual It has been suggested by Golla, according 
to Shipton, that long exposure to such sumulation might succeed 
m effecting a permanent change tn the mental state of a subject 
If electroencephalographic analysis is analogous to attempting 
to isolate an instrument in an orchestra while watching a delayed 
television broadcast, then electrocardiographic analysis u analogous 
to listening to a high fidelity stereophonic tape recording of Heifitz 
The first human electrocardiogram was recorded by a London 
physiologist, Augustus D Waller, who introduced the term 
“electrocardiogram” into science The first clinical mvestigauon 
was earned out by Thomas Lewis in England An important early 
paper was published in 19J3 by Einthoven, Fahr and de Waart 
and was titled “On the Direction and Manifest Size of the Vari¬ 
ations of Potential in the Human Heart and on the Influence of 
the Position of the Heart on the Form of the Electrocardiogram ” 
The heart is a muscular organ situated in the center of the 
chest It IS a paired organ, the nght side receiving blood from 
the body and projecting it into the lungs, while the left side receives 
blood from the lungs and propels it out into the systemic or general 
body circulation The upper two chambers are called atria and 
receive the blood The blood then passes into the more muscular 
ventricular chambers There is a septum between the two sides 
of the heart and the site of origin in electneal depolarization which 
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stimulates the rhythmic beat of the heart is located in this midline 
septum There is a smo-auncular node (SAN) ^vhere the beat 
normally begins for propagation over the auricular walls and this 
sumulates contraction and propulsion of the blood into the 
ventricles The impulse then begins at the auriculoventncular 
node (A V N) and passes down the septum between the two 
ventncles in the special conductive bundle of His and thence out 
over the walls of the ventncles to cause con traction of the ventncles 
The heart rests on the dome of the diaphragm in such a way 
that with deep diaphragmatic breathing the heart is caused to 
rotate within the chest The electrical impulse probably passes 
through the heart in what can be thought of as an eccentric double 
layer of negative charges on the inner surface of the chambers and 
positive chaises on the outer surfaces with depolarization repre¬ 
senting the propagation of the impulse The impulse is propagated 
from the heart out through the body to the periphery where it can 
be recorded with attached wire leads and a stnng galvanometer 
Emthoven thought of the signal as approximating what would 
anse from a single dipole located m the region and positron of the 
heart or a heart vector In 1912, George Fahr recorded heart 
sounds simultaneously with electrocardiograms and these and 
other studies revealed the relationship of the P-wave to contraction 
of the auricles and of the QRS and T-waves as related to the 
depolarization and repolarization of the ventricle 

It was apparent to these investigators that the position of the 
heart influenced the form of the electrocardiogram, that heart 
rate was probably related to vagal tone, and that there ^vas an 
assoaated vanaUon in the pathway of excitation with changes m 
rate They observed that hypertrophy of the cardiac muscle caused 
changes that were similar in many respects to the changes caused 
by shift in position of the heart One of the most common ab¬ 
normalities of aging man is coronary artery insufficiency and 
occlusion of various branches of the arteries with consequent im¬ 
pairment of conducuon Sometimes the infarction mvolves a 
branch of the bundle of His and so called bundle branch block 
occurs Occasionally complete heart block is present Also there 
may be abnormal placement of the site of ongm of the impulse 
m the auricle There is also a variety of disturbances in rhythm 
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It has been customary to attach electrodes to the extremities 
and to use an additional electrode placed at various positions over 
the chest for some of the readings and to record varying com- 
binations of these electrodes Jbr a total of some twelve standard 
leads The voltage is recorded against time on paper The analysis 
is made by measuring the heights of various components and the 
widths and distances between these components as well as studying 
the shape of the waves For certain diagnoses some of the leads are 
much more valuable than others The reader has some information 
about drugs that the patient is receiving, especially digitalis 
preparations or other drugs which might affect the electrocardio¬ 
gram and also usually has the blood pressure recorded and a 
little information as to the type of problem which the clinician 
ordering the electrocardiogram is looking for The cardiologist 
a^so IS famihar with the common artifacts vvhich must be dis¬ 
regarded as noise such as 60 cycle current and muscle tremor 
Although some measurements are made, these arc rather inexact 
and most of the interpretation is qualitative rather than quan¬ 
titative 

To all of the other variables wJijch affect the interpretation 
there are the added variables of the complex human electro- 
cardiologist such as training, experience in reading the tracings, 
alertness or fatigue, and emotional bias Validation or feedback 
15 haphazard so that the cumulative accuracy of the opinions is 
impressionistic A good elcctrocardiologist does have a vast amount 
of informauon from carefully controlled studies in which the 
deviations from normal have been studied m select groups with 
autopsy confirmed abnormalities 

There have been careful studies made with models such as the 
one of Brody and Romans where electrically conducting material 
m the form of sheets of paper were cut out in the shape of a human 
body with a dipole constructed in the position of the heart and 
additional superimposed sheets of paper representing the heart 
tissue, lungs, abdominal viscera, etc There have been studies with 
recordings from within the esophagus and withm the heart itself 
and within a bronchus near the heart 

Nevertheless, when studies have been made of observer inter- 
pretauon of electrocardiograms, there is good agreement among the 
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cardiologists only on the most marked abnormalities There is 
surprisingly wide difference of opinion m interpretation of some 
of the tracings Epstein studied the diagnoses of five different 
observers using 451 records from young Spanish soldiers and found 
that the individual observers even made behveen t\^o and mne 
reclassifications of their own original readings These variations 
were due more to interpretations than they were to objective 
descnption of the cntena for diagnosis 

Segall studied the reports of twenty physicians using a set of 
100 electrocardiograms and observed “the degree of vanation in 
opinion IS greater than I had anticipated m any group of men 
who work together and attend conferences together ” There ivere 
twenty patients in the group known to have myocardial infarction, 
but One of the readers diagnosed myocardial infarction twenty-one 
times and another forty-nrae times There tvas a raajonty of agree¬ 
ment in 77 per cent of readings 

In a similar study by Davies with nine readers^simtlar results < 
were observed ^vlth complete agreement m only one-third of cases ' 
On the average Davies’ readers disagreed with one out of eight 
of their onginal reports Davies concluded that the overlap of 
normal and abnormal tracings is much greater than is generally 
realized 

Acheson observed that the only diagnosis which was uniform 
was that of atrial fibrillation ^Thcrc appeared to be a continuum \ 
between normal and abnormal electrocardiograms and between j 
the various different diagnoses made of disease Acheson felt ^ 
that perhaps the safest opimon to render was Simply that the 
electrocardiogram showed coronary heart disease rather than 
trying to separate left ventricular hypertrophy, mj ocardial ischmia 
or myocardial infarction 

—In contrast to this, carefully programed computer analysis is i 
devoid of the day-ta^ay vanation which is inherent tn the spot type ' 
of human endeavor The programs are written and rewntten byv 
experts in electrocardiology, mathematics, statistics and computer ' 
technology, each individual adding his experience and each defect 
being corrected as it becomes apparent in the results of the analyses ' 
By virtue of these corrections the program is heuristic Another 
reason for the increased accuracy of the computer reading is that 
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many more measurements of amplitude and latency arc made 
together with the complicated mathematical analyses which would 
^ be prohibitively timc-consurmng and also liable to additional error 
\ if done by hand The interpretation can be printed out in any 
' desired detail with some of the common measurements customanly 
reported TTie results of the analyses can be stored m computer 
memory along with information derived at a later time about the 
final diagnoses and outcome so that probabilities can be derived 
for each diagnosis Tlie conclusions arc qualitative and arc, there* 
for, ideal for incorporation into larger probability equations con* 
taming data from other sources Pipbergcr has stated “From results 
evaluated so far, it appears that the diagnostic accuracy of the 
described computer mctliods considerably exceeds that of con* 
ventional procedures used in clinical practice ” 

Clmicians who feel concerned about the intrusion of computers 
into the analysis of electrocardiograms should be reminded that 
1 the electrocardiographic machine is already an analog computer 
I and that data reduction begins m the sampling and lead selection 
Since the output is in anal^ form and the analyses are earned 
1 out with stored program digital computers, one of the firtt require¬ 
ments IS analog to digital conversion 'Die continuous line is scanned 
at selected intervals, the potential difference measured and con¬ 
verted to a number which can be recorded, usually on magnetic 
tape, but in a digital code The computer is then programed to 
identify some standard part of the tracing to provide consistency 
in the analyses 

_, Dificrcntiation is used TTie change in successive voltages is com¬ 
puted and this dcnvaiivc, the time rate of change of the signal, 
IS used to determine the point of the curve where the greatest 
negative rate of change takes place, or the R*wave The program 
jidfentifics other waves in Tclation to this point Up until this time 
abnormalities of rhythm have been excluded from analysis and 
separate programs arc bemg written Jt is necessary, therefore, 
to cither prescreen the tracings or else to have instructions incor¬ 
porated in the program to identify tracings with abnormalities 
in rhythm \V7icn programs become available for analy’ses of dis¬ 
turbed rhythm the two types of programs can then be placed in 
the computer and the program designed to automaucally shift 
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into the appropnate analysis, whether it be for regular rhythms 
or abnormal rhythms The power output is determmed by inte- 
gratmg the curves » 

Standard analytical techmque has been programed by Caceres, 
Steinberg and others as well as by Gustafson, Daniel, White and 
Staples (for children) Initial definitions of normal wave forms 
according to Caceres were arbitrary, but as precise as could be 
made from the clinical criteria usually used As expenence is 
gamed ivith the analyses these definitions have been revised and 
made more precise Sixty-cycle noise and skin and muscle elec¬ 
trical activity are eliminated by smoothing with a fitted nine point 
least square parabola The measurements of the identified wave 
forms are compared with the standard values and the logic in 
the program uses the results of these comparisons to instigate 
appropriate printing of statements as to the mteirretation Sta¬ 
tistical tables of vanance arc used to determine probabilities which 
are also incorporated in the diagnoses No analytical techniques 
are used that are unfamiliar to the physician who is used to reading 
these tracings except for the statisucal addition Even with this 
standard approach Caceres reports that the reproducibility and 
precision of measurement are much greater with machine analysis 

Gustafson observes that computer anal>sis is even of greater 
value m working with children m view of the changes that occur 
coincident with growth and development and the effect that this i 
has on the cntena for diagnosing disease 

Pipbeig^er has observed that if only frequencies up to 200 counts 
per second were used it would be necessary to take 12,000 voltage 
readings for each second of recording if t\velve standard leads are 
used There has long been a feeling that the multiple standard 
leads contained much redundant information and that if the heart 
IS analogous to a dipole in three dimensional space that it should 
be possible to get all of the available information by recording 
three leads, each perpendicular to the other two 

Scher, Young and Meredith studied this problem with factor 
analysis This is a statistical approach based m a matrix of simple 
correlations designed to determine the number of unique param¬ 
eters in a large number of variables They recorded between eight 
and thirty-six simultaneous leads in sesenteen normal persons. 
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The conclusion was that there were no significant voltages at the 
body surface in normal individuals which could be ascribed to 
more than three internal generators This does not prove that the 
heart can be considered as a dipole or heart vector, but it does 
suggest that all of the important information can be obtained by 
use of three orthogonal leads 

Frank, and Schmitt have both described ways of obtaimng these 
leads Frank uses one lead near the back of the neck, a standard 
left leg electrode and a senes of leads around the chest at about 
the fifth interspace Two of these are lateral Two are antenor 
With one of these making an angle of 45 degrees with the left 
lateral lead The one posterior lead is on a line from the nght 
anterior lead through the center of the chest 

By appropriate placement of resistors in the wiring and com¬ 
binations of these leads, a recording of three orthogonal or un¬ 
related vectors is accomplished and Frank feels that these arc 
optimum for the necessary compromises that must be made 
between a sound theoretic basis, accuracy, reproducibility, signal* 
to-noise ratio and speed of application The lead placement 
corrects for differences m torso shape and avoids the variation m 
left arms between individuals It is relatively insensitive to van* 
ation in the location of the vetitncle due to variation m the dia¬ 
phragm position or other causes 

This was the climax of some five years of intensive theoretical 
and expcnmcntal investigation and was an attempt to correct the 
many knoirn errors m existing methods of recording which seemed 
necessary before advances could be made in the quantitative area 
of analysis The great advances, both in the laboratory and clin¬ 
ically, %vill come from wide application of vector geometry and 
matrix algebra and various techniques referred to in mathematical 
name dropping as vector analysis, Fourier, harmonic and tensor 
analysis, use of orthonormal exponentials, etc 
y A vector is like an arrow, it has both direction and length 
The propagation of the clectncal activity in the heart can be 
thought of as an arrow originating at one point and directed 
normally antenorally down and toward the left By the use of 
the three unrelated (nght angle or orthogonal) leads it is possible 
to measure the Q|RS vector and to use the Pythagorean theorem 
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which says the square of the hypotenuse is equal to the sum of the 
square of the sides If then the squares of the three vectors are 
added together and the square root obtained the result is a measure¬ 
ment of the spatial magnitude of the Q.RS vector If a similar 
measurement is made for the T-wave it is then possible to cal¬ 
culate the angle between the QRS and T-waves 

If the analytical technique is earned further and so-called polar 
vectors and eigenvectors are used it is possible to represent the 
propagation of the electrical impulse in the region of the heart, 
not as an arrow but as a vector loop which moves around in a 
plane at right angles to the polar vector If looked upon from the 
side the vector loop is shown to vary above and below the plane 
to give the appearance of a figure eight The polar vector identifies 
the orientation of the plane in space and the eigenvectors and their 
ratios indicate the configuration of the loop Vector analysis allows 
study of the directional and scaler (or power) components sepa¬ 
rately and combined 

Pipberger studied fifty-six patients who had histones of old 
myocardial infarction and found abnormalities m the power (or 
scaler) analysis m 43 per cent of the cases Abnormalities in 
direction (or vector) components were observed m 95 per cent 
of the cases and when both amplitude and direction were used 
abnormalities were found in 98 per cent of the cases It was possible 
to study separately the QjRS complex and T-waves as well as the 
combination of these components which is called the spatial 
ventncular gradiant 

Pipberger carried out another study comparing the standard 
twelve lead electrocardiogram with the three, corrected, ortho¬ 
gonal lead tracings using analyses by physicians and found that 
there was a difference of 7 3 per cent of cases in favor of the t\\ elve 
leads as far as recovery of information was concerned However, 
this was related to the clinician’s ability to extract the informauon 
and was not from information loss in the three orthogonal leads 
When the recordings of the three orthogonal leads were used to 
resolve twelve leads similar to what clinicians usually look at there 
was then no difference whether the clinicians studied the onginal 
tracings or the tracings resolved from three orthogonal leads 

Even computers have limitations in their storage capacity and 
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there is a continual effort to avoid redundancy, to determine how 
much information is really available m any original signal and 
to obtain this information in a manner which will be useful to 
the clinician Spatial vector analysis would seem more acceptable 
once It becomes generally available than the twelve lead tracing 
In the case of left ventricular hypertrophy, for example, Pipbergcr 
has found that the progressive cardiac enlargement is associated 
with rotation of the polar vector in an anterior and inferior direc¬ 
tion He has also studied 134 cases of myocardial infarction one 
or more years after the condition was first diagnosed in a patient 
After the tivclve month interim a twelve lead electrocardiogram 
permitted diagnosis in 70 per cent of the cases In eight per cent 
of the cases the electrocardiographic readings were not definite 
enough to give a diagnosis and in 22 per cent of the cases the 
electrocardiogram indicated no abnormalities m the patients 
studied These same cases submitted to diagnosis by the more 
elaborate analyses were diagnosed 100 per cent as having had 
myocardial infarction Because orthogonal leads have a much 
smaller normal range tlian conventional leads, there is a better 
separation between normal and abnormal and this leads to greater 
accuracy m detecting the abnormal heart conditions Pipbergcr 
has analyzed over 3000 electrocardiograms using several mathe¬ 
matical curve analysis methods and liopes in the future to be able 
to narrow the number of different types of analyses Even with the 
multiple analyses Pipberger’s computer program is able to analyze 
700 to 800 records per hour 

Young and Huggins arc using a program which determines 
orthonormal exponentials allowing each electrocardiographic lead 
to be represented by twelve numbers Since there are three ortho¬ 
gonal leads this makes a total of (hirty-six numbers Nine addi¬ 
tional terms arc used to represent the direction cosines that 
describe the rotation of the heart in space, making a forty-five 
dimensional vector Standard multiple dimension linear regression 
analysis is then applied and each subject is assigned an additional 
attribute vector according to the diagnostic category For example, 
the attribute vector of a normal subject is (1,0,0,0,0,0) Other 
groups of patients having left ventricular iiypcrtrophy, right 
ventricular hypertrophy, left ventricular conduction defect, right 
ventricular conduction defect and infarction were each assigned 
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unique attnbute vectors The cat^onzation of disease was earned 
out by a physician It was possible to accomplish perfect diagnoses 
m the fifty-three subjects studied by this program, but the number 
of patients studied was admittedly inadequate to judge the use¬ 
fulness of this approach 

Enslein has also recently suggested the application of pattern 
recognition methods to biomedical problems, particularly the 
interpretation of x-rays and electrophoretic patterns, but also the 
reading of electrocardiograms This approach is not too dissimilar 
from that of Young and Huggins TTic only requirement as far 
as the analysis is concerned is that the program be trained hy^ 
assigning arbitrary numbers to each of the disease classifications 
and then that an appropriate number of knoivn cases of each of 
the diseases be shoivn to the computer so that when that pattern 
IS later shoivn as an unknown the various exponentials ivill be 
correct to give the designated answer Thus, after training of the - 
program, nght ventricular hypertrophy might have been given 
an arbitrary number five, normal electrocardiograms the number 
one, etc and then when an unknown is shown if it is normal the 
number one should show up as the ansiver It has just been an¬ 
nounced by one of the computer companies that they have de¬ 
veloped a scanner which wiU identify pnntcd letters and numbers \ 
and which is called a Multi font optical character reader It was 
observed dunng study of the scanner that the computer linked 
ivith this device performs some functions that have not been 
worked out completely The early scanning rate is fifty characters 
a second, but the potential speed is hundreds of characters a 
second It is hoped that this approach and type of equipment can 
be used in language translation and there is no reason ivhy the 
translation of Russian to English should be any more difficult for 
pattern recogmtion techniques than the translation of electro¬ 
cardiographic inscriptions to printed diagnoses 

It IS very obvious in the pattern recognition approach to com- ^ 
puter analysis of electrocardiographic tracings that the physician’s 
chief duty is to provide very accurate diagnoses of the patients ^ 
^vhosc tracings are used for traimng the program This is equally \ 
true of the other analytical methods, but it becomes less obvious 
when the computer is programed to simply carry out the standard | 
analyUcal techmques that are now m use Validation with the 
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necessary statistical analysis of results com6s from careful follow 
up of patients and on going revisions of diagnoses and corrections 
of the computer programs in use as the experience accumulates 
There have been studies of large numbers of supposedly healthy 
young men which showed relatively large numbers of interesting 
and supposedly important electrocardiographic findings, but much 
of the value of some of these studies is lost in that no long time 
follow up has been carried out 

( If all a physician ivishes from a computer is an electrocardiO' 
graphic based diagnosis then there will be programs available to 
provide this with the accompanying probability plus a list of 
differential diagnoses with them respective probabilities This will 
not prevent the physician from studying the electrocardiographic 
tracing Schmitt has emphasized the additional use of computers 
in combination with human visual perception and possibly even 
accompanied by a loud speaker output so that the physician may 
listen to certain parameters of the tracing while he watches a 
stereoscopic view of the vector loop in three dimensional space 
Schmitt has developed ways in which a stereoscopic projection 
can be rotated and viewed from different directions in three 
dimensional space with a cathode ray oscilloscope 
Abildskov, Ingerson and Hisey call attention to the deficiency 
of vector cardiograms as recorded on oscilloscopes as far as the 
temporal relationships arc concerned and have developed a 
sphencal co ordinate system which describes a point on the sur¬ 
face of a sphere The vector is then presented in terms of magnitude 
and two angles Such displays might be helpful for study of selected 
patients with suspected posterior myocardial infarction Perhaps 
some of these visual techniques can be used to work out the prob 
lems that arise when heunstically programed computer programs 
for pattern recognition come up with information that cannot be 
classified It seems as though more attention should be given, at 
least in some laboratories, to the on line analysis of electrocardio 
grams m the fashion of the clcctrocnccphalographers (even though 
the problems of elcctrocardiology may be “trivial” by comparison) 
No doubt the electrocardiographic laboratory of the near future 
will contain equipment to record plionocardiograms, peripheral 
pulse contours, respiratory patterns, respiratory gas analyses and 
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other information all simultaneously and there seems to be no 
limit to the computing that could be done 

The electrocardiographic traang like brain waves serves as a 
source of biologically meaningful signal which is used m system 
feedback as a transponder function This is detailed in the next 
chapter, but brief mention will be made here Once there is 
identification of a segment of the tracing such as the R-wave then 
this can be used as a timing signal to activate peripheral equip¬ 
ment which IS required to function in a particular phase of the 
cardiac cycle In patients with recent coronary artery thrombosis 
and myocardial infarction or in patients with hemorrhagic, trau¬ 
matic or bacteremic shock, who have developed a significant com¬ 
ponent of pump failure and need assistance with their circulation 
until the heart can recover, it may be possible to assist with a 
vancty of mechanical devices Thus a patient may receive “arti¬ 
ficial circulation” just as he may receive assistance by “artificial 
respiration ” 

Claus and recently Harken and Lefemme have investigated the 
use of a large cannula and aspiration of blood from a large artery 
durmg systole (while the heart is pumping blood out into the 
arteries) and then blood is pumped back into the same artery 
during cardiac diastole (that penod when the heart is relaxed 
or not pumping) Thus the heart pumps against a very low re¬ 
sistance system and additional propulsion of blood is provided 
from an outside source of energy Others are using an intraaortic 
counter-pulsating balloon The most appealing idea for ease of 
application and safety if it can be developed is to use either the 
lower extremities or the intrathoracic contents as a second heart 

Osbum, Russi, Sale, Bramson and Gerbode have shown that 
both methods are possible Both are timed with the electrocardio¬ 
gram A pressure suit is used for the legs and the suit is pressured 
dunng diastole of the patient’s heart so that the patient’s diastohc 
blood pressure is actually raised above systolic and cardiac output 
m dogs has been increased by 30 per cent When the pulsation is 
used m the airway the timing is regulated so that the external 
pulse nse is synchronous with caniiac systole and the endo¬ 
tracheal pressure is droppied during diastole ^vith no change m 
mean ainvay pressure and no mtcrference with the normal res¬ 
piratory rhythm 
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When should computer use he considered^ Automation in research 
should not be undertaken vvitliout good reason The initial steps 
are prodigiously expensive if equipment is not already available 
and always consume sufficient time and energy as to make the 
results seem illusory for the small one-run problem and often 
diversionary m relation to the original question These objections 
are minimal if the problem involves use of standard, already avail¬ 
able equipment and programs Even the experienced user may 
hesitate to automate a new project because of the time and expense 
required in preparation of data The reasons for automation might 
be divided into three general categories (1) The volume of data 
requiring storage, classification, arrangement into various relation¬ 
ships, and the number of simple analytical procedures, have be¬ 
come humanly impossible (2) The complexity of the analyses, 
the number of variables, the number of mtcr-related equations 
makes the study impractical without machine processing, and 
(3) The speed with which the results of analysis are required 
exceeds human capabilities 

There arc also degrees of automation in research varying from 
the use of a computer much as one might use a desk calculator to 
carrying out the entire experiment within the computer Examina¬ 
tion of the varying degrees of automation in research suggests a 
which wiU be used in presenting a number of other¬ 
wise unrelated research projects and these subdivisions will be 
taken up m the following order (1) data housekeeping and routine 
statisucal analyses, (2) on-lmc computer investigator feedback, 
(3) transponder function, (4) dynamic simulation, (5) adynamic 
Simulation and (6) combinations of die above An attempt will 
70 
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be made to define these different uses, but subsequent examples 
should help also and the reader should not give up on the next 
few pages First, we need some definitions of terms On-line com¬ 
pulation means that the computer takes the data in real time and 
camcs out some or all of the necessary computations immediately 
so that the investigator may have use of the results in the experi¬ 
ment while that experiment is being run The investigator can, 
therefore, use the computer results to modify experimental pro¬ 
cedure A transponder Junction is programed ahead of time so that 
during the experiment the computer not only analyzes some of the 
data and prepares the results of the analysis for use in the experi¬ 
ment, but automatically responds to certain predetermined cir¬ 
cumstances that arise in the experiment and provides appropriate 
modification of the expenmental procedure without any inter¬ 
ference on the part of the investigator 

In dynamic simulation the computer analysis is earned out in real 
time with components designed to simulate the system under study 
A signal IS taken from the living subject or provided for both the 
living subject and the computer Tlie signal is processed in its 
normal manner by the living orgamsm and also by the simulation 
program m the computer and then the obser\ed and calculated 
results are graphed so that the investigator may check on the 
accuracy of his model Dynamic simulation is a real time expen- 
mental situation in tvhich the investigator can immediately alter 
vanous coefficients or components in the model with the living ex¬ 
perimental subject acting as a very close check on his model building 
With a transponder function the computer becomes a component 
m the experiment In dynamic simulation the computer and the 
expenmental subject may be completely separate as far as any 
automatic feedback is concerned Dynamic simulation is more than 
just computer analysis of expenmental data since the objective is 
to have the computer simulate the actions and reactions of the 
expenmental subject The invcsugator makes adjustments of the 
computer to make it a better simulation Since the expenment is 
in real time and since the expenmenter wishes to have a model 
which follows the living subject over as ivide a range as possible, 
the ongoing results will govern some of the sequence and param¬ 
eters chosen for variation by the investigator 
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Dynamic simulation is modeling with analog and experimental 
subject in parallel and with only qualitative or rough quantitave 
preconceived information as to the functional characteristics of 
components involved Continued real time human inspection is 
used as a substitute for more elaborate theory and the subsidiary 
hypotheses as to the actual quantitative and mathematical relation¬ 
ships that will be found Real time simply means that the com¬ 
putation IS completed immediately and results are made avail¬ 
able so that the real and calculated signals can be compared in 
the same recording 

Adynamic simulation is modeling in sequence, off-line, at any 
lime, tn tntro and completely within the programed computer It 
requires a complete physiologic, hydraulic, or metabolic, map with 
every stationary and kinetic component predetermined within 
acceptable limits Predictions arc made by use of the program and 
arc then checked by specific experiments and chemical or otHcr 
analyses for intermediary products or signals m accessible points 
of the living system A vanety of medical research projects will 
be outlined to illustrate in greater detail the vanation of com¬ 
puter use and at the same time some of the methodologic problems 

DATA HOUSEKEEPING 

The fint example illustrates a research project which was started 
by the National Institute of Health on January J, 1950 and has 
led to the use of computers by virtue of the large volume of data 
which must be processed The type of equipment and methods 
of use of that equipment are essentially those used in business 
Originally the data were put on standard punch cards and then 
as this became too cumbersome and with more recent availability 
of magnetic tape storage, the information has been transferred 
to tape 

This was begun as a prospective epidemiologic study of heart 
disease in Framingham, Massachusetts, a town of 30,000 popula¬ 
tion, eighteen miles west of Boston An initial random sample 
involving two-thirds of the rwidents between ages thirty and 
fifty nine was drawn and these individuals were then invited to 
participate m the study Only 68 8 per cent of these people actually 
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came for examination Some 740 volunteers were then added to 
the original selection so that the total group who ^ve^e found to 
be free of coronary heart disease at the outset was 5,127 An 
attempt was made to find out as much as possible about the 
individuals in the community who were not included in the study, 
as far as their mortality rate and development of kno\vn cardiac 
disease was concerned 

The individuals included in the study have been subjected to 
thorough initial examinations and repeat examinations at two 
year intervals, consisting of history, physical examination and 
routine laboratory tests with special studies such as electrocardio¬ 
grams, blood cholesterol levels and chest x-rays for all participants 
and more detailed analyses as indicated This would seem to be 
a nearly ideal experiment, but no experiment can ever be com¬ 
pletely prospective and perfect m design 

The early death rates in the participants and m the to^vn as a 
whole were difficult to assess because of lack of at least minimal 
data on the health status of the entire population pnor to the 
random selection of participants It is not known exactly what the 
relationship was between state of health and refusal rate of the 
residents Another question which has been hard for the investi¬ 
gators to evaluate (and is often present to a degree) is the effect 
of the experiment per se (the fact that the expenment is being 
done) upon the results Whenever any abnormality is found, the 
patient’s family physician is notified and further investigation and 
possible treatment is instituted which could be another source of 
bias 

These investigators have also made note of the concern that 
some statisticians have for using statistical procedures in investi¬ 
gations where subgroups are selected from the original population 
on the basis of certain entena instead of selection m a random 
manner Gordon, Moore, Shmtlcff and Dawber observe “A 
statistical statement must begin and must end with the probability 
model Thus when you compare the mean blood pressures for a 
coronary and a noncoronary group the probability statement you 
make refers to two sets of numbers, assumed to conform to a 
specified probabihty behavior, not to blood pressure or coronary 
disease You may decide the means of the two sets of numbers 
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are or are not the same by whatever statistical rules you have 
chosen to use This is a statistical decision When you go beyond 
this to make assertions about coronary heart disease and blood 
pressure the inference may or may not be justified, but it is not 
statistical ” 

One of the interesting questions about expenmental design which 
this study helps to answer and which may help m the validation 
of other studies of selected groups of individuals with coronary 
disease has to do with the question of the selective effect of the 
disease event or of death Looking at this from the other direction, 
if an investigator sets out to study coronary artery disease and 
gathers together all of the hospital charts of patients with this 
diagnosis, does he have a representative sample of all patients 
with coronary disease or does his sample have considerable bias 
because of the patients already eliminated by death or eliminated 
by the fact that their coronary artery disease has not yet been 
recognized’ It was observed m the Framingham study that serum 
cholesterol levels, blood pressure and weight were essentially the 
same after recovery as they had been in these same patients before 
Furthermore, the weights, blood pressure and cholesterol levels 
for men who developed new coronary heart disease were the same 
for the men who died as for those who survived So far nothing 
has been found m this study which should discourage retrospective 
study of coronary heart disease 

In a massive study of data of this sort availability of computer 
equipment encourages a much more thorough study of many more 
relationships, but also helps in studying the methodology per se, 
and at times points up some of the inadvertent bias that human 
participauon contributes For example, a study was made of the 
distribution of reported blood pressures according to the last digit 
in the blood pressure It was observed that there was a predilection 
for recording even digits and the most popular observation was 
the end digit “zero ” Sixteen hundred and fifty-one systolic blood 
pressures were observed to end in “zero”, as contrast to only one 
ending in the digit “one,” and only one ending in the digit “three ” 
For diastolic blood pressure there were 823 observauons ending 
in “zero” and none ending m the digit “one ” The digits “two,” 
“four,” “six” and “eight” seemed to be very popular and the 
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odd digits unpopular except of “five” It would be interesting 
to observe the difference between blood pressures recorded by 
human observers and by a mechanical monitor in such a study 

Computers can assist in a large study of this sort not only m 
examining the distribution of data, but in standardizing the scales 
used At first there were, for example, only the initial values of 
cholesterol for each patient, but with each successive examination, 
additional cholesterol values have been added to the over-all study 
In order to use cholesterol in companson with other measurements 
It was desirable to standardize the values by converting them all 
to standard deviations This, of course, required computing a mean 
for all cholesterols measured prior to the time of the computation, 
determining the deviation of each result from this mean and 
squaring all these differences They are then summed and the 
root mean square deviation, or standard deviation, was then avail¬ 
able so that the computer could convert every measurement from 
Its original scale of milligrams per cent to standard deviations 
Furthermore, the scale can be accurately divided into percentiles 
and each cholesterol level can be categorized as to its percentile 
rank This makes possible a more definable study of the effects 
of cholesterol on the incidence of disease This type of standardiza¬ 
tion of scores is of even greater importance when the interactions 
of several vanables arc under examination This will be gone into 
m some detail in Chapter VI 

Large intensive studies such as the Framingham study are to 
interest not only to students of cardiovascular disease, but of 
problem solvers in general who must use similar methodology 
There is conunued self criticism of vanous aspects of the study as 
well as stimulating criticism such as the paper of Lew in which 
he discusses the effects of persons refusing to co-operate m the 
study and other forms of self selection and the difficulties of inter¬ 
pretation and generalizations from the observations that are made 
This large study shares with smaller studies the need for repetition 
m other geographic areas and under other circumstances to con¬ 
firm the observations made and to determine to what extent 
generalization is justified from the results 

A computer oriented investigation is of necessity organized in 
such a way that data can be selected and compared or combined 
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With data from other studies New retrospective investigations can 
be made with minimal additional work involved since the data 
has not only been collected but is accessible An example of this 
IS the recent publication of Doyle, Dawber, Kanncl, Heslin and 
Kahn on cigarette smokmg and coronary heart disease in which 
they combined information from the Albany Cardiovascular 
Health Center and the Framingham Study This gave them a 
total of 4,120 men who were free of coronary heart disease at 
their first examination The results of this analysis revealed that 
heavy cigarette smokers experienced a three fold increase m the 
incidence of myocardial infarction and in death from all causes 
as compared to nonsmokers, pipe and cigar smokers and former 
cigarette smokere These findings, according to Lew are based on 
more reliable statistics than many of the other findings in the 
Framingham Study, but how the results eu-c interpreted has 
nothing to do with computer use 

The computer is a tool which makes possible the examination 
of data from many viewpoints and makes possible snidics which 
otherwise would not be undertaken or would be undertaken on a 
much more restricted scale Certainly computers will make the 
concept of preventive mcdiane much more important by facili* 
tatmg long'term supervision of initially asymptomatic adults in 
order to delineate those characteristics which are most likely to 
be associated with high risk of certam diseases The practicing 
physician svill be better able to identify susceptible individuals 
He Mill have a better means of estimating the magnitude of the 
risk It may even, we might hope, be possible to diagnose and 
counteract epidemics of disease, even chronic disease, if the per¬ 
tinent information can be obtained and appropnatc counteracting 
measures discovered 

It appears so far from the Framingham Study that cholesterol 
IS a better predictor of coronary heart disease than other lipids 
that were measured Systolic blood pressure is a valuable predictor 
Whether or not there is true interaction between some of these 
factors, as far as prediction is concerned, a combination of two 
or three of the abnormal variables studied, becomes quite highly 
predictive of overt coronary disease 

It IS not necessary to institute a lai^ scale study and to collect 
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thousands of multivanant observatiom in order to find applications 
of computers If there is a computer center available, standard 
library programs for a variety of statistical analyses are ready for 
use For example, Hidalgo, Nadler and Bloch, used an electromc 
digital computer and a regression analysis program to determine, 
from data collected on some nmety-two men and sixty-three 
women, the best fit for the relationship of blood volume to height 
and body weight The regression analysis program will be dis¬ 
cussed further in Chapter VI 

The most intriguing applications of computers are in problems 
which are too complex for man’s limited capacities of storage and 
association Dayhoff and Ledley have developed a senes of seven 
programs for use in protein biochemistry Proteins are long chain 
polymers of ammo acid molecules There are only about twenty 
ammo acids, but each protein is different because of the order in 
which these acids are arranged All enzymes and cell structures 
are dependent upon the arrangement or the coding of ammo acids 
in protein The ammo acids m turn are produced on a template 
of nucleic acid matenal which is also coded A real understanding 
of life, inheritance, evolution, growth, development and cellular 
metabolism sooner or later leads back to the question of how these 
protein molecules are put together Because of the coding prob¬ 
lem, the study of protein structure is even of interest to the com¬ 
puter industry 

Ordinarily chemists have laboriously and rigorously analyzed 
chemical compounds by determining the atomic arrangement of 
each part and by successive degradations and chemical reactions 
With the development of x-ray defraction analysis and the appear¬ 
ance of electronic calculating machines it was possible to make 
more rapid progress Pauling has desenbed the stochastic method 
by which oi^anic chemists have studied structure and in which 
the chemist makes an educated guess as to the structure on the 
basis of the information at hand and then proceeds to carry out 
a few experiments to check specific points regarding his con¬ 
jecture This has been difficult with the protein molecules because 
of their size The original protein molecule is usually broken 
down by enzymatic hjdrolysis into shorter chain structures of 
ammo acids called peptide fragments and then the chemist must 
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conjecture as to the sequence of these fragments Some of the 
protein molecules in a sample are broken at different points in 
the chain from other molecules so that there is an overlap in the 
peptide fragments There are certain ammo acids which occur 
infrequently or in unusual relationships 
Dayhoff and Ledley constructed a model of a protein using 
different colored “pop” beads to represent ammo acids The exact 
arrangements of the “ammo acids” in the model were recorded 
and then the model was randomly fragmented into peptide-like 
fragments and these fragments were also listed The data denved 
was then used to test the senes of programs which were wntten 
to determine the possible ways m ivhich the original molecule 
could have been arranged to give nse to the fragments observed 
The program took into account the possibility of some errors m 
peptide analysis and ways in which these errors could be elimi¬ 
nated The program was wntten so that the input and output 
would be alphabetic and mnemonic Three letter abbreviations 
were used for each ammo acid Intennediate results of the analysis 
are printed out so that the biochemist can study the sequence of 
syndiesis that the probability theory program is going through in 
order to reassemble the molecule It is possible to insert special 
rules into the program regarding certain end of the chain groups 
that arc known to exist for the protein molecule by virtue of studies 
of the first ammo acids that come off during hydrolysis 

The programs are thus 'vntten so that the biochemist can and 
does actively participate in the computer analysis Just as the 
biochemist has a very elaborate amino acid analyzer to help 
determine the fragments zs they come apart during hydrolysis he 
may use the computer as another tool for the theoretical resynthesis, 
but the chemist remains a very important part of the system and 
It IS unlikely that the structure of proteins could be determined 
through computer use by a completely automated program 


ON-LINE COMPUTING 

On-line real tune computation with the investigator making 
immediate use of the results to further modify the experiment has 
been discussed in the previous chapter m relationship to electro- 
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encephalographic special purpose computers Zimmer has de- 
scnbed this application m a psychophysiologic study of autonomic 
nervous system reactions The autonomic nervous system is, as 
Its name implies, part of the more or less automatic regulatory 
system of the body which provides homeostasis and it is affected 
by emotional states Zimmer records respiration, heart rate, 
peripheral circulation, skin conductance, muscle action potential, 
body movement and reaction time This information is recorded 
along with data concerning the time and duration of stimulus 
presentation and the onset and termination of re enforcement 
symbols 

A computer program takes the nearly 9000 bits of data which 
are collected in the course of thirty seconds following a single 
stimulus and summanaes this m what is called a Lacey’s Autonomic 
Lability Score This is recorded as an index of the subject’s reaction 
and is related to the prestunulus baseline This sounds a little bit 
like what the lay person thinks of as a he detector Experiments of 
this character might reveal the least number of types of observa¬ 
tion which might be useful m monitoring a patient’s emotional 
stability and response while he took a Cornell Medical Index and 
a Minnesota Multiphasic Personality Inventory, but away from 
"cloud nine” and back to the illustrations 

TRANSPONDER FUNCTIONS 
It was mentioned in the last chapter that respiration had been 
observed by Einthoven and fellow workers to influence the electro 
cardiographic tracing The elevation of the diaphragm lifts the 
heart and rotates it with each respiration which has an especially 
marked effect on spatial vector cardiograms If there are three 
types of signals mixed together from the conduction system of the 
Vieart, from the efJect rf dnange m position o5 the diaphragm, and 
from interfenng noise of sixty cycle current and muscle tremor, 
and if the relationships between breathing and cardiac cycle are 
irregular, it becomes very difficult to obtain data which allows 
analysis of the effects of respirauon on the electrocardiographic 
tracing The transponder function solves this problem by keeping 
track of the electrocardiographic wave sequence and signaling the 
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patient when to breathe and also timing the recording of the 
electrocardiogram so that repeated tracings arc superimposed and 
averaged As each succeeding curve is averaged into the previous 
one the pattern becomes progressively clearer as the unwanted 
noise IS averaged out 

Transponder /unctions are important considerations in the 
development of patient monitonng since they may be used to 
eliminate the recording of large volume of normal data, but can 
be set to institute recording as soon as abnormalities appear 
Schmitt has suggested that transponders be considered for appro¬ 
priate institution of treatment under certain circumstances One 
might consider, for example, the addition of a transponder function 
to the cardiac pacemaker in a patient with interrmttent complete 
heart block so that as long as the patient’s cardiac rhythm was at 
an approximately normal rate the pacemaker would be inactive 
but as soon as the normal rhythm was interrupted the cardiac 
pacemaker would take over 

DYNAMIC SIMULATION 

Warner has used an electric analog of the carotid sinus m which 
the computer acted also as a component m the experiment, taking 
recordings of arterial pressure from the carotid sinus and feeding 
additional stimulation back into the vagus nerve so as to simulate 
and exaggerate carotid sinus regulatory function They found that 
the carotid sinus was important in dcterming steady state levels 
of mean blood pressure, but did not respond to rapid fluctuations 
in pressure 

Clyncs has studied the respiratory heart rate reflex and its 
change with age and has suggested that permanent records could 
be maintained for individuals, collecting additional data over the 
years and watching for the normal changes that occur with aging 
as well as unexpected changes which might herald the develop 
ment of some disease process at a stage before it would otherwise 
be recognized 

The eye has proven to be a very common subject for study of 
biologic^ servo systems Stark has studied the pupil area by 
measuring the reflection of invisible infra red light with changes 
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m intensity of visible light as the stimulus The ciliary musculature 
IS a very accessible and responsive muscle to observe ivith only 
one degree of freedom in its motion Considerable is known about 
the anatomy of the pupil and reflex which involves the light path, 
optic nerve, the lateral geniculate body and some fibers ivhich go 
directly to the bram stem and are relayed from there to the 
Edinger-Westphal nucleus which m turn contains parasympathetic 
nerve cells whose fibers can be traced to the ciliary ganglion which 
then controls the ins muscle There are other nerve cells which 
send fibers through the sympathetic nerve system through the 
supenor cervical ganglion and cause dilatation of the pupils 
Three vaneties of light stimulation are used When a Made beam 
is shone through the pupil the amount of light reaching the retina 
is decreased as the pupil constricts which is called a closed loop 
type of stimulation since, as the pupil constnets, there is less light 
hitting the retina and, therefore, less stimulation to cause con¬ 
traction of the muscle This is the normal closed loop feedback 
mechanism A second type of stimulus (open loop) is to shine a 
very narrow beam of light so that, although the pupil is stimu¬ 
lated to constnet, this constriction causes no decrease in the amount 
of light reaching the retina A third type of stimulus is to shine a 
narrow beam so that it impinges on the edge of the pupil and 
very slight contraction of the pupil will have a very marked effect 
on the amount of light passing into the eye Thus there is a very 
high gain effect in the feedback mechanism By use of an appro¬ 
priate stimulus the regulatory system for control of light entering 
the eye may be studied as a normal closed feedback, an open loop 
with no feedback, or with high gain (exaggerated) feedback 
Other workers have used computer techniques to study the servo¬ 
mechanism controlling eye movement in tracking a moving ob¬ 
ject or in maintaining a fixation upon an object Mhile the head 
and body are moving 


SIMULATION 

Simulation need not be on-line It can be completely tn vitro 
with separate laboratoiy living system expenments done later to 
test the results predicted by the computer program Simulation 
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ofTers some of the greatest chaJJengcs m computer applicatio 
since It ofTcrs a means of testing liypothcscs about the inte 
relationships of system components as well as study of the con 
ponents themselves Warner and Cox used analog computer simi 
lation of the mathematical model of the heart rate control b 
sympathetic and vagus nerve function and compared the con 
putcr model with anesthetized dogs In building their model the 
took into consideration information gathered from other invest! 
gations regarding the concentration of norepinephrine in it 
storage sites in nonmcdulatcd sympathetic nerve fibers, certaii 
characteristics of its release, the concentration of circulatin] 
norepmephrme in the blood, certain characteristics of the sino 
auricular node in the heart which controls heart rate and 
postulated substance m the node which theoretically must rcac 
with norepinephrine in order to change heart rate This mvolvec 
postulation of some six different rate constants The model pre 
dieted that raising the concentration of norepmephnne in cir 
culating blood would have no effect on the maximum heart rat< 
achieved by sympathetic stimulation, which was confirmed in the 
experimental animal They also set up a similar model of the 
vagus nerve function and its discharge of acetyl choline Then 
model seemed to account for simple stimulation of either the 
sympathetic or vagus nerve, but could not account for efiecti 
observed when both forms of stimulation were used 
Brittain Chance has written digital computer programs con 
taming equations for all known steps m the chemical reactions 
that take place for selected segments of cellular metabolism This 
allows experimentation with some of the less accessible portions 
of the simulated living cell and then when actual experiments arc 
run with the biological system, only spot checks need to be made 
to test the accuracy of the computer prediction 
There are close simdantics between the models used for enzyme 
kinetics, studies of transport of compounds between various com 
pariments of the body and for epidemiology An enzyme is often 
thought of as a specific type of small ferry boat which plies back 
and forth across the energy barrier of a specific chemical reaction, 
picking up substrate on one side and depositing the passenger 
molecules as product of the rcacuon on the other side on the 
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opposite shore Competitive inhibitor compounds of similar struc¬ 
ture to the substrate may also sit in the seats and thus slow the 
rate of transport of the substrate The same mathematical models 
are applied to the transport of chemical compounds across the 
kidney tubule and across cellular membranes throughout nature 
In epidemiology the ferry boat is not a single enzyme or a 
specific transport substance but is the bacterial agent and the 
passengers are people who have not previously had the disease 
They become infected and are earned across the nver to the 
immunity community (or across the nver Styx), but the mathe¬ 
matical models and computer programing have much in common 
A well trained and interested mathematician could find ample 
problems in biology and medicine for exercising his talents All 
biologists must make an effort to cross this bamer too, but this 
is a subject for the last chapter 
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A WORKED EXAMPLE 

The clinical investigator or medical research worker who wishes 
to use automatic data processing facilities usually feels initially 
somewhat like a child tvaiting for his parents to begin a vacation 
trip There seem to be so many preliminaries which delay actually 
getting started The final computer data processing may take only 
five or ten minutes The development of hypotheses, design of 
experiment, collection of data and preparation of data for analysis 
may require months or years Clinicians often have their en¬ 
thusiasm dampened further by the ngor of the approach required 
by automatic data processing techniques 
The work need not necessarily be mathematical, but it is at 
the least very logical and requu^ most careful definiuon of each 
step The research worker who has mastered clinical medicine and 
required portions of chemistry, physics, physiology, anatomy, etc 
may suddenly rcabze that he needs much more knowledge of 
statistics There are specialists in related disciphncs who can 
provide a certain amount of help, but the only specialist in that 
particular research project is the investigator himself and just as 
he needs to know a great deal about all of the other aspects of his 
expenment, he must also have considerable insight into the logical, 
statistical, and mathematical aspects of his work 

A narrative renew of a research prcyect ivhich required the 
use of automatic data processing: has been chosen for more detailed 
illustration of the application of computers since it involves the 
types of problems that are frequently met with in clinical research 
There will be frequent interruptions in this account in order to 
explain various statistical matters or to illustrate from earlier 
experiments why certain analyses >verc considered desirable or to 
84 
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explain aspects of actual computer programing and the details of 
computer use in this study Fust, a short resume of the research 
problem wll be presented 

Our currently complex treatment of patients with a vanety of 
illnesses and ivith extensive surgical procedures has saved many 
lives but has brought out the problem of acute renal failure which 
often adds to the patient’s illness and further complicates the 
treatment There are at present no available means of adequately 
evaluating the diagnosis and prognosis of acute kidney damage at 
a time when something specific could be done to reverse the 
process There are certain drugs that are occasionally used in the 
treatment of resistant infections which are known to be toxic to 
the kidney In spite of careful crossmatchmg of blood there are 
still occasional transfusion reactions followed by kidney failure 
There are also a vanety of acute and chronic diseases of the kidney 
which are recognized primanly m their late stages by discovcnng 
the nsmg blood urea nitrogen 

In all of these instances a more sensitive means of diagnosing 
and following active early damage m the kidney would be of con¬ 
siderable benefit to the clinician Not only would early diagnosis 
be a help but it would be possible then to evaluate the success or 
failure of a proposed treatment For example, hundreds of patients 
are being given intravenous Mannitol at the present time dunng 
and immediately after surgery to prevent renal failure, but there 
has been no specific way to estimate which patients might have 
had renal failure had the Mannitol not been giv^n 

Serum enzyme concentrations are used as indices of organ damage, 
as for example in the study of glutamic oxaloacetic transaminase 
as an index of acute liver cell breakdown or acute myocardial 
infarction The basis for these analyses rests in our concept of 
enzymes as being large protein molecules residing m cells and 
being released from cells into the blood stream or surrounding 
tissue fluids only when those cells are damaged In order for such 
tests to be useful the blood or tissue fluid sampled must have come 
into contact with the cells under question In acute damage to 
the kidney there should be loss of cellular enzymes into the unne 

The overall, kidney, enzyme losing, experimental model could 
be, of course, much more complicated than this Almost 200 quarts 
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of fluid are filtered from the blood each day by the kidney glomeruli 
and all but one quart reabsorbed by tlie tubules The blood cells 
and larger protein molecules m the blood are prevented from being 
filtered through the glomerulus into the nephron by virtue of the 
small size of the pores m the glomerular filter Some smaller 
protein molecules may slip through, especially if there is damage 
to the glomerulus 

When protein does pass into ihe nephron Jt is phagocytjzed by 
cells lining the tubule and much of it is probably removed from 
the fluid which eventually ticcomes unne Certain substances are 
excreted by the cells lining the nephron and enter the urine by 
this active contribution of cells rather than by passive filtration 
through the glomerulus It is probable that m the course of the 
excretion of certain solids into the lumen of che nephron same 
protein molecules may be lost m small amounts 

The amount of enzymatic protein which reaches the unne may 
be affected by the rate at which the fluid passes through the 
nephron, thus altering the time available for removal of a given 
enzyme molecule by phagocytosis Also jf the tubule cells are 
required to do an excessive amount of osmotic or chemical work 
there may be additional loss of enzymes The enzyme content 
of the cells js qualitatively dificrent at different levels of the 
nephron and the pattern of enzyme excretion by the kidney could 
very well be related to the functional activity of the proximal 
and distal convoluted tubules and the ascending loop of Henie 
as well as the collecting tubules Enzymes could also be lost from 
the collecting system of kidney piclvis, ureter, bladder and urethra 
There may be a certain obligatory loss of enzymes as cells deter¬ 
iorate and die and are replaced by new cells This might be 
difficult to distinguish from mild toxic effects 

Initially the urines for this study were obtained from a variety 
of normal ambulatory individuals as well as normal hospitalized 
patients and patients with potential kidney disease It was found 
that certain enzymes were stable in unne and could be measured 
with an accuracy sufficient to determine that there was a difference 
between hospitalized patients and nomial, well, ambulatory people 
Certain patients were studied in detail, collecting serial samples of 
unne and it was observed that there were curvilinear variations 
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with time which were not random and must, therefore, have some 
significance either relative to kidney function or to some aspect 
of the integrity of the kidney 

The urine enzyme levels did not seem to be related to the serum 
enzyme concentrations, suggesting that the enzymes in the unne 
came from the kidney rather than from the blood and other 
organs in the body Three enzymes have been studied extensively, 
namely glutamic oxaloacetic transaminase (GOT), beta-glu- 
curomdase (B-G) and leucine ammo-peptidase (LAP) A senes 
of expenments were carefully designed using dogs in metabolism 
cages and varying the intake of food, water and solute in some of 
the expenments and, in other expenments, adding agents ^vhlch 
are considered to be toxic to the kidney 

In one expenment where both Neomycin and egg albumin were 
injected intravenously, there appeared to be a highly significant 
increase in the loss of all three enzymes m the unne With the 
injection of egg albumin alone the nse in enzyme excretion \vas 
less striking There was additional question about the results of the 
experiment because of the occasional need to discard unne speci¬ 
mens contaminated in the metabolism cages from nonforroed stools 
immediately following administration of the toxic agents A ne^v 
expenment was, therefore, earned out and will be used for illus¬ 
tration of computer applications beloiv 

It IS known that egg albumin introduced into the circulation 
as a foreign protein causes the development of an allergic reachon 
in ammals and also that the foreign protein can be found in the 
tubules of the kidney The expenmcntal design chosen called for 
injection of egg albumin on Monday morning and unne collec¬ 
tions beginning at 0800 hours after the injection T^venty-fou^ hour 
unne collections were then made on Tuesday, Wednesday, Thurs¬ 
day and Friday mornings at 0800 hours If any emesis or diarrhea 
occurred immediately following the injection, the cage was cleaned 
before unne collection was started 

In addition the dogs were all maintained on a reduced dietary 
intake Fnday, Saturday and Sunday and until noon on Monday 
The diet consisted of kennel biscuits containing 448 calones and 
4 05 grams nitrogen/lOO grams biscuit The feedings were divided 
into t^^o equal feedings, one given at noon and the other at 4 
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o’clock m the afternoon The food was weighed out and given 
according to dog weight Earlier it was determined that the average 
dog in a metabolism cage would eat a certain amount of this diet 
when the diet was provided in unlimited quantities In the present 
experiment one-third of this ad libitum average amount was given 
to all of the dogs on tvcek ends This greatly reduced the incidence 
of vomiting and diarrhea following injection of egg albumin It did, 
however, introduce another variable 

Earlier experiments had shown that variation m food intake 
affected markedly the excretion of leucine amino peptidase (LAP) 
and to a lesser extent the excretion of other enzymes The present 
experiment was therefore designed to evaluate the eflect of vari¬ 
ation in food intake In order to do this the dogs were divided 
into tivo “C” groups Both groups were fed the one-third ration 
over each week end and group “two” was fed the onc-third ration 
during the week as well Group “one” dogs were given three- 
fourths of an average ad libitum intake on Tuesday through Fnday 
The general layout of the experiment is shown in Figure I 
The mam simple effects to be observed and outlined thus far 
in the expenment were therefore defined as (A) the influence of 
injected egg albumin and (C) the amount of diet There could 
also be some interaction between A and C For example, the 
reduction in diet in group “two” animals (C2) could make the 
kidney more or less susceptible to the effects of a foreign protein 
as measured by excretion of enzymes in the unne 
Another condition seemed to be m need of constderauon m the 
expcnmental design, namely the mfluencc of repeated injections 
of foreign protein as it might affect the sensitivity of the animal 
Tins might be manifest as an increasing or decreasing loss of 
enzymes after injection of egg albumin as the kidneys became 
more or less sensitive to the foreign protein There was also a 
possibility of another interaction or combination effect For ex¬ 
ample, the group '“one” dogs (Cl) receiving the higher food intake 
during the week might have more enzyme to lose from the kidney, 
but this effect might become Jess vinth time Thus there could be 
a three way interaction InteracUons might also be observed 
between varying sensitivity and diet alone, or betivecn the effect 
of egg albumin injection and the sequence of injections In order 
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A 



Figure 1 This u a Luidquist Type VI analysis of variance design It can be 
considered a four dimension table in which a cntenon vanable such as the 
urine concentration of leucine ammo peptidase (LAP) could be entered for 
analysis It is apparent that variation can be analyzed by dogs (S), days (A), 
treatment repetitions (B) or differences in diet for groups of dogs (C) Arrows 
show injection tunes for all dogs m both C| and Cj Shaded areas show the 
actual days when unne samples were collected Table II shows the results of a 
Type VI analysis for LAP Figure 4 shows the mean values obtained from the 
analysis Similar analyses of variance were earned out using other urme con 
stituents as the criterion variable but the results are shown only for LAP The 
analysis is based on the null hypothesis that there is no significant difference m 
the variation of the criterion vanable which can be attributed to A, B C, AB, 
AC BC or ABC influence 


to provide a factor of “influence of repeated exposure to the foreign 
protein” the experiment was replicated four times along another 
dimension which will be designated as “B ” The simple effects 
A and B and the interaction eflfects of AB, AC, BG and ABC are 
all effects in which comparison can he made wtlhtn do^ and the 
difference between animals does not need to be considered as 
affecting the analysis Each dog provides both control and experi- 
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mental data The C effect or difference between levels of feeding 
IS of necessity a between sul^ects effect Dogs are either in Cl or 
C2 and not in both 

A standard mixed design was selected which provided this com 
bination of within dogs and between dogs effects by consulting a 
summary Table [H 17, Lindquist] in a standard design and analysis 
text This particular analysis is called a Type VI design In the 
State University of Iowa Computer Center, analysis of variance 
programs, witten by Spiker and compatible with Lindquist’s text 
are available on a magnetic tape m the computer program library 
It IS usually possible to select a standard design which will fit the 
experiment This has not only the advantage of saving a great 
deal of time for the investigator, but such standard library pro¬ 
grams have been tested not only by their originators but by many 
different users With the expcnmental problem and a specific 
design outlined, the next step is to do the experiment 

Healthy mongrel dogs were randomly selected and placed m 
metabolism cages and provided with the diet as it was to be given 
during the actual experiment In addition egg albumin was given 
intravenously in a dose of 1 ml/kg on each Monday just as if the 
experiment had been started, but no urine samples were collected 
The dogs were thus sensitized to egg albumin before the actual 
expenment was begun After three weeks of allowing the dogs to 
become accustomed to the routine, unne was then collected from 
the eleven dogs m the two C groups for four days of each week 
and for three weeks making 132 samples This was called a B 
slice and the entire experiment was then repeated three more 
times so that 528 unne samples were obtained in four B slices for 
the entire expenment An estimate of the average value for any 
measurement made in these samples ivould, therefore, have 527 
chances to vary, or 527 degrees of freedom 

It would seem somewhat wasteful to spend three or four months 
collecting unne specimens and makmg the injections for the pur 
pose of answenng only one question Actually three enzymes were 
studied in the unne so tliat an analysis of vanance could be made 
for each of these three enzymes Earlier experiments indicated that 
LAP would be the most likely enzyme to be influenced by diet 
Earlier experiments with the injection of nephrotoxic agents also 
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revealed that all three enzymes were elevated in kidney damage 
Other unne analyses were earned out m order that multiple 
regression analysis could be used For example, in one earlier 
experiment which lasted 130 days, a group of thirteen dogs were 
given a varying diet, vanation in water intake and vanation in 
the sodium content of the water Multiple regression analysis then 
revealed that when GOT was the dependent variable and crea- 
timne, sodium, potassium, chlonde, phosphorus and freezing point 
depression were used as independent variables, that potassium, 
freezmg point and sodium were all important predictors of GOT 
excretion, both with concentrations of these moieties and with 
total amounts excreted per day The partial-F for potassium was 
especially high The correlation between potassium and GOT 
excretion was positive and between GOT and sodium negative 
This suggested a hypothesis that perhaps some of the GOT in 
the unne is lost from the distal convoluted tubule coincident with 
the exchange of sodium for potassium 

In other earlier experiments where sodium intake was low 
throughout, there was no correlation behveen potassium and GOT 
excretion This would be consistent with the proposed hypothesis 
since It IS known that potassium m unne remains at a lower level 
as long as sodium reaches the distal convoluted tubule in relatively 
low concentrations This does not pro\e the suggested hypothesis, 
but does illustrate the way in ivhich examination of the data may 
reveal other interesting and worthwhile avenues for investigauon 
Multiple regression analysis also has the potential of revealing 
certain relationships between those entenon vanables selected for 
analysis of variance and other variables measured in the same 
urine samples Results of regression analysis may assist in explaining 
the source of the effect of those significant factors in the expenment 
such as diet or the injection of a toxic agent If significant relation¬ 
ships are found between the excretion of an enzyme and the 
excretion of other moieties, then covariance analysis can be used 
to further delineate the relationships and may point the way to 
additional experiments which may explain in greater detail the 
reasons for the effects observed in the analysis of vanance 
The present experiment was, therefore, elaborated to include 
measurement of these other moieties, urine volume, pH, specific 
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gravity, total solute (by freezing point depression), creatinine, 
sodium, potassium, chlondc and phosphorus concentration, in 
addition to the activity of three enzymes GOT, LAP and B G 
After the entire experiment had been run, specimens collected 
and analyzed, data recorded in the laboratory book and also on 
punched cards and ultimately upon magnetic tape it would seem 
that surely now some computer analyses could be started 

Before reporting upon the actual outcome of the analysis, it is 
worth mentioning some of the underlying assumptions that must 
be fulfilled or at least considered before carrying out the actual 
computation 

The analysis of variance is a very useful statistical model, but 
in considenng this model certain conditions must be met if the 
results are to be interpreted The first assumption is that the 
entenon measure for each treatment population is “normal” or 
Gaussian in distribution The necessity for adherence to a precise 
model cannot be overemphasized Many of the models of mathe¬ 
matical statistics dealing with the mean and standard deviation 
are valid only when the population has a normal distribution 
Secondly, the analysis of vanance is based upon certain standard 
hypotheses (1) all groups were drawn from the same population, 
(2) there is no difference in the population variation m the dif¬ 
ferent groups, and (3) the population means for each group are 
the same If any of the above conditions arc not true then the 
analysis of vanance will show significant deviations from the null 
hypothesis under consideiation 

The investigator is interested in testing for significant vanation 
between groups as far as the averages or means arc concerned 
He would also like to avoid incorporating into the expenment 
other causes for “significance” such as lack of homoscedasucity 
(lack of uniform variability in the scale) or lack of a normal 
(or Gaussian) distribution Therefore, before running an analysis 
of vanance the distribution of data should be examined to see 
what type of distnbution is present If it is not normal then a 
normalizing transformation roust be considered 

There is no particular mason why one scale should be used 
in preference to another m reporting the results of a biochemical 
analysis As a matter of fact the spectrophotometer has two scales 
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to read One is called transmission and the other density The one 
IS arithmetic and the other loganthimc Here then is a trans¬ 
formation built into the very dial from which the results are read 
Therefore, any scale can be used that satisfies the requirements 
of the statistical model 

Since most of the analyses used in parametnc statistics (sta¬ 
tistics that depend upon a specific distnbution) assume a “normal” 
distribution this distnbution will be analyzed in some detail 

1 1 4b ji)« 

Any distnbution function Y = •' of this form is called 

a normal probability density function or p d f with mean, p and 
vanance, <r* The title, “probabihty density function,” implies that 
the area beneath the curve of this particular function is equal to 
one (!) and that since it is a continuous distnbution, a precise 
probability statement can be made from a segment of the total 
area Because of the extensive expenencc with this particular model 
much analysis has been made to find out what random samples 
can be made to approximate the normal curve One powerful 
example of this is denved from the central limit theorem which 
says that a distnbution of mean values tends to^vard normality 
as the sample size increases 

When making deductions of a general character from samples 
drawn at random from an unknown population the researcher 
often needs to test whether a sample of values is likely to have been 
drawn from a Gaussian parent population In devising tests of 
normality two classes of procedures have been followed— (t) a 
normal curve is fitted to the sample data and the x* test for good¬ 
ness of fit IS applied (2) Certain ratios of the moments of the sample 
are calculated and the significance of their departure from the 
expected value for a normal distribution is examined 

First consider procedure "one ” Although it has advantages it 
IS somewhat insensitive to real departures from normality This 
IS due to two chziractenstics of this test First, it takes into account 
only the magnitude of differences between the observed and ex¬ 
pected group frequencies and not their sign or arrangement and, 
second, because of the necessity of grouping, the effect of small 
frequencies in the tail of the distribution is eliminated and errors 
due to grouping of data result In the following, the concern is 
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with procedure “two” and the outcome of the specific normalira- 
tion procedure used in this example 
The statistical method for testing the assumption of normality 
using ratios of moments wll be presented before the discussion 
of the actual numerical results This mathematical approach is 
based on the moments of the normal distribution First, let Xi, - - *, 
Xw denote the random sample from a parent population From 
this random sample a test is made to determine if the sampling 
comes from a normal distnbution The first four moments about 
the mean of any population distribution are defined as follows 

PI ■= 2 

p,-:£x.*/n 

m which the deviations are x ® X — X and where the mean, 
__ JV 

X » 2^ X</N N here denotes the sire of the random sample 
Nmv Pi IS merely the mean of the deviations from the mean or as 
IS sometimes called, the first moment about the mean pj is the 
mean of the deviations squared from the mean or the variance 
p» IS the third moment about the mean and P4 is the fourth moment 
about the mean These moments distinguish certain types of dis- 
tnbuuons by certain ratios that can be formed from these moments 
When these ratios approach specific values, the random sample 
IS known to approximate a sampling distribution that is normal 
or Gaussian 

These ratios of the moments are called betas and are defined 
as follows 

01 - pVpS 
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The two ratios, and Pa serve as a means of defining departure 
from normality These ratios arc called sample statistics since they 
are denved from the random sample If the parent population is 
normal the sampling distnbuUons of these two sample statistics tend 
toward normality as the sample size increases Their means are 
zero (0) and three (3) and their sampUng standard deviations are 
approximately VS/n and ^24/0 respectively Thus any significant 
deviation from *= 0 and P* = 3 indicates that the data does 
not have a normal distnbution Furthermore, Vft ** ^ measure of 
the skewness of the sampling distribution and P* is a measure of 
the kurtosis If ts greater than 0, the distribution has longer tails 
on the right (positively skewed) and if Pi is less than 0, it has 
longer tails on the left (negatively skewed) Pj indicates the general 
form of concentration around the mean If p* is less than 3, the 
distribution appears to be less heavily concentrated about the 
mean and if Pi is greater than 3, it is more heavily concentrated 
about the mean 

In order to evaluate each vanable for its normality of distn 
bution and then to transfonn it and finally to repeat the above 
evaluations with perhaps several repetitions requires long and 
laborious computations Therefore, without the assistance of auto¬ 
matic data processing one might be quickly discouraged, especially 
wth 10 variables and 528 samples 

Since no library program was available to test the assumption 
of normahty one had to be written First the steps of such a pro¬ 
gram were outlined in a block diagram Then a senes of FOR¬ 
TRAN or formula translation statements were wntten which was 
translated by the computer into a machine language program for 
computing VA and Pi 

The samphng distnbutions of •y/fti and Pi have been determined 
by P Williams and E S Pearson, respectively From these deter¬ 
minations It has been found that if the sample is large, N>100, 
departures from normality may be tested by treaUng -V^/a/^TN 
and (pj — 3)/-^247N as normally distnbuted values Hereafter 
these two normally distnbuted values will be denoted by Zi and 
zi, respecuvely 

Figure 2 represents the flow chart to evaluate the normahty 
of each of the ten vanablcs under consideration The Fortran 



Figure 2 Thuu the flow chart ofa program designed to calculate sample atatuacs 
which can be used to evaluate the frequency distribution of a set of data 'Ihe 
actual FORTRAN statements are shovi’n in Figure 3 


Figure 3 These are the formula traratatkon (FORTRAN) statemeots that were 
used to carry out the various steps of die analysts shown in Figure 2 The results 
of such an analysis program applied to a senes of transformed cntcrion variables 
are shown la Table I 
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C BULGREN.MASON—544-0004 

DIMENSIONSX(15),SX2{15)^X3(15),SX4(15),CSX(15),CSX2 
(15),1CSX3(15), CSX4(15),TMEAN(15),TVAR(15),TMU3(15), 
2TMU4(15),BETA1(15),BETA2(15), TSIG(15),Z1(15),Z2(15), 
3X1(15),X2(15),X(15) 

D03K=1,528 

READINPUTTAPE14,1,(X(I),I = 1,10) 

I FORMAT(F4 0,2F3 0,7F4 0) 

X1(1) = LOGF{X{1)+200) 

Xl(2)=LOGF(X(2))/3 
Xl(3)*=LOGF(X(3)- 75)/2 
Xl(4)«LOGF(X(4)- 75)/2 
X1(5) = LOGF(X(5)+100 )/2 
X1(6)“X(6) 

Xl(7) = LOGF(X(7)+7 )/2 
XI (8)=LOGF(X(8)+700 )/2 
X1(9)=LOGF(X(9)+100 )/2 
XI (10)=LOGF(X(l 0)+625 )/2 
DO4J«l,10 
SXa)«SX(J)+Xl(J) 

SX2a)=SX2(J)+Xl(J)**2 
SX3U)=SX3q)+Xl(J)**3 
4 SX4(J)-SX40)+X1(J)**4 
3 CONTINUE 
P»528 

14 DO6J=l,10 

TMEANq)=SX(J)/P 

TVAR(J) = ((P*SX2a)-SX(J)*SX(J))/(P*P)) 

TMU3(J) - (P*P*SX3a)“3 ^P*SX(J) •SX2(J)+2 •SX(J) • 
1SX(J)*SX(J))/(P**3) 

TMU4(J) = (P*P*P*SX4(J)-4 •P*P*SX3(J)*SX(J)+6 ‘P* 
1SX2(J)*SX(I)*SX(J)-3*(SX(J)**4))/(P*M) 

BETAl (J) = (TMU3(J) • *2)/(TVAR(J) **3) 

BETA2(J) =TMU4(J)/(TVAR(J)**2) 

TSIG(J) = TVAR0)**5 
BETA1(J)=BETA1(J)** 5 
Zl(J)=BETAl(J)/((6/P)**5) 

6 Z2(J) = (BETA2(J)-3)/((24/P)**5) 

DO12Jl = l,10 
WRITEOUTPUTrAPE3,21 

12 WRITEOUTPUTTAPE3^22JUTMEAN((l),TSIG(Il),BETAl(Jl), 
1BETA2(J1),Z1 ( T1),Z2(J1) 

II CONTINUE 

21 FORMAT(62H VAR MEAN SIG BETAl 

1BETA2 Z1 Z2) 

22 FORMAT(I2,5H ,6F10 5) 

END 
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seen in Pigure 3 was written For the IBM 7070 Since Fortran 
IS compatible with a large number of computers, this program 
could be used by any interested reader 

FORTRAN IS a system of processing statements which are 
written in a language which is understandable to man and ending 
up ivith machine language statements which will run the com 
puter This system, which is stored in the computer, takes each 
FORTRAN statement and translates it into a senes of machine 
language instructions Such a translation was earned out to pro 
duce the machine language program needed to carry out the flow 
chart shown in Figure 2 

With this program it was possible to quickly evaluate all van 
ables to be used If a transformation was required for normalization 
some suggestion was provided as to whether skewness or kurtosis 
(that IS asymmetry, or abnormally peaked or flat configuration) 
was the problem The transfonnation used was almost always of the 


form K* — 


log (x + c) 


where c and k arc constants The actual 


values of the constants c and k were determined by trial and 
repetitive evaluation of and 0t until a normal distnbution 
was approximated 

A very important point needs to be made at this stage regarding 
the use of transformations and analysis of variance It is possible 
to transform all significance out of any set of experimental data 
The supposition on which the null hypothesis is based is that the 
experimental conditions cause no diflerence From the control con 
ditions in the criterion variable 

If one were to use the expenmentai data and transform it so 
as to have a normal distribution, then if there were any real 
diflcrences between groups it would be lost Therefore, trans 
formations of all data for analysis of variance roust be made on 
the basis of standard control data However, what is control in 
one direcCion may be expenmeofal tn another direction of an 
analysis of variance design This would appear to be an insoluble 
problem and a senous objection to the analysis of vanance method 
unless the design is such that a very well defined standard control 
set of data can be selected for each analysis 

The importance of a transfonnauon to a normal distnbuuon 
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to obtain additivity This term refers to the property that when 
two objects, containing equal amounts of an attnbute, are com¬ 
bined the third object is equal to the sum For example, the distance 
from M o to four inches is exactly the same as the distance from 
122 to 124 inches However, the area difference between a two 
and a four inch cube is twelve cubic inches, %vhercas the difference 
between a 122 inch and a 124 inch cube is 90,776 cubic inches 
This illustrates a scale in which additivity does not exist because 
the two differences added equal 90,788 and not 4 
Table I illustrates the transformation used for each variable 
Associated with each vanabic is the transformation and the cor¬ 
responding -yjpi and pt values along ivith the zi and z* which are 
the standardized normal scores described earlier Both the initial 
and transformed scale sample statistics are shown This shows the 
difference that the transformations made and how ivith the anth- 
metic values the assumption of normality may be far from valid 
In the analysis of vanance example under discussion the selected 
transformation was formulated around data collected dunng 
another (control) experiment m which normal mongrel dogs 
were eating the same type of diet and drinking water ad hbttum 
and without the administration of any toxic agents 
With the new scale the analysts of vanance was ready to run 
Since an analysis of vanance is a library program, all that is 
required to run a particular analysis is a few additional control 
instructions These instructions specify the input and output of 
the data, or the format as it is called, and the number of samples 
for each group under consideration These control cards adjust 
the library programs to the dimensions of the experiment under 
consideration by indicating the location of data to be read into 
the computer With the entenon vanabic being specified by the 
control instrucuons, all the other vanables stored in the data cards 
or on magnetic tape will be ignored 

We will now consider for purposes of illustration the specific 
computer center which contains our library routine The State 
University of Iowa Computer Center is at present chiefly a closed 
shop That is one in ivhich special operators are required to 
operate the equipment This specific electronic digital computer 
IS run by means of a monitor system with magnetic tapes as the 
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only input-output The monitor is an automatic means of telling 
the computer to run in a specific and economic manner so that 
many different projects can be processed in one run A smaller 
computer is used for all the auxiliary functions associated with 
handling the data such as transfemng data from punched cards 
to magnetic tape, pnnting the contents of the magnetic tape and 
for some of the preliminary sorting of the data m order that input 
time for the larger computer will be as short as possible 

All that the user must do is to provide the center with the 
information as to the library program desired, the control state¬ 
ments or cards, and the appropriate set of data cards for the 
specific library program Twice daily the output is made available 
in the form of pnnted sheets of results If more rapid access to the 
results is needed, priority may be achieved for the investigator 
by contacting or actually working with the operator The results 
come from the small scale computer (IBM 1401) by way of 
the printer attached to it The pnnter yields tables covering sheets 
approximately twice as lai^ as typing paper at a rate that will 
frighten the most avid seeker of results A computer can produce 
enough results m an hour to keep the research team busy for 
another six months Many people only use the computer for a 
few minutes at a ume and many use it only once a month or less 
unless they are analyzing data from some ongoing experiment and 
have laige volumes of data 

The standard library program that was used for this particular 
analysis was the Type VI analysis of vanance Only the analysis of 
LAP ivili be presented The output from this particular analysis 
was pnnted in the form of a tabic with the sums and means across 
for row column slices, row slices, column slices for group 1 dogs 
and then for group 2 dogs and then row column sums and means, 
column sums and means, rmv sums and means, slice sums and 
means and finally a summary table showing degrees of freedom, 
sum of squares, mean square and F ratios Table II illustrates the 
preceding by showing the pnnted output of the LAP analysis of 
vanance from the computer The investigator may look for any 
“wild” values in these tables and if he finds some ivild values he 
can then check back in the ongmal data for errors in recording, 
such as misplaced decimal pcants The standard table of F-raUos 
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TABLE ZI 


Type VI Anova CnnuioM •• }Loo (Lap — 6 5)J/2 


Group C] High feed 

Row—Column—Slice Sums 

6 OB 5 10 

5 74 

6 43 

7 05 

6 53 

8 10 

7 70 

6 97 

6 92 

6 60 

6 97 

5 89 

6 75 

5 90 

7 11 

7 45 

6 59 

6 33 

6 86 

6 89 

6 46 

6 37 

5 98 

5 98 

6 26 

6 09 

7 12 

7 U 

6 32 

6 84 

6 55 

9 98 

7 28 

7 49 

8 05 

6 38 

6 79 

6 36 

6 63 

6 10 

4 98 

5 14 

5 20 

8 15 

5 96 

5 B2 

5 26 


Row—Column—Slice Means 

1 01 0 85 

0 96 

1 07 

i 18 

1 09 

1 35 

1 28 

1 16 

1 15 

1 10 

i 16 

0 98 

1 13 

0 98 

1 18 

1 24 

1 10 

1 06 

1 14 

1 15 

1 08 

1 06 

1 00 

1 00 

i 04 

1 01 

1 19 

1 19 

1 05 

1 14 

I 09 

i 66 

1 21 

i 25 

1 34 

1 06 

t 13 

1 06 

1 11 

1 02 

0 83 

0 84 

0 87 

1 36 

0 99 

C 97 

0 88 


Marginal Statuuca 

Row—Slice Sums 

80 19 

78 58 

85 09 

72 77 




Row—Slice Meau 

1 n 

1 09 

1 18 

1 01 




Oo]uinn—Slice Sums 

24 33 

24 90 

24 09 

27 29 

27 74 

24 41 

26 41 

26 31 

32 00 

26 62 

26 28 

26 26 



Column—Slice Means 

1 01 

1 04 

1 00 

1 14 

1 16 

1 02 

1 10 

1 10 

1 33 

1 It 

1 09 

1 09 



Group C] 

Row—Column—Slice Sum* 

5 93 5 49 

4 77 

4 43 

4 72 

4 55 

4 54 

4 49 

4 83 

5 12 

4 49 

4 64 

4 18 

5 31 

5 07 

4 45 

4 33 

4 01 

3 74 

3 es 

4 86 

4 18 

4 12 

3 93 

4 TS 

3 76 

3 38 

4 17 

4 59 

4 44 

4 62 

4 21 

4 66 

4 07 

5 18 

3 77 

4 40 

4 54 

3 22 

3 83 

4 51 

4 51 

4 64 

3 19 

5 61 

4 69 

3 96 

4 08 


Row—Column—Slice Mean* 

1 19 1 10 

0 9S 

0 89 

-2 

0 94 

0 91 

0 91 

0 90 

0 97 

1 02 

0 90 

0 93 

0 84 

1 06 

1 01 

0 89 

0 87 

0 80 

0 75 

0 77 

0 97 

0 84 

0 82 

0 79 

0 84 

0 75 

0 68 

0 83 

0 92 

0 89 

0 92 

0 B4 

0 93 

0 81 

1 04 

0 75 

0 88 

0 91 

0 64 

0.77 

0 90 

0 90 

0 93 

0 64 

1 12 

0 94 

0 79 

0 82 
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TABLE H iCord’d) 

Marginal Statistics 
Row—Slice Sums 


58 00 

52 04 

51 03 

$1 18 




Row—Slice Means 

0 97 

0 87 

0 85 

0 85 




Golunui—Slice Sums 

18 71 

15 74 

19 10 
19 97 

16 43 
18 06 

16 87 

17 75 

18 14 

16 41 

17 52 

17 54 

Column—Slice Means 

0 94 0 95 

0 79 1 00 

Marginal Statistics Kt Cl, R1 C2 

0 82 0 84 

0 90 0 89 

R4-C12 

0 91 

0 82 

0 8$ 

0 88 

Row—Column Sums 

12 01 

10 59 

10 51 

10 86 

11 77 

11 08 

12 64 

12 19 

n 60 

12 05 

n 10 

11 61 

10 07 

12 06 

10 97 

It 56 

11 77 

10 61 

10 07 

10 71 

11 76 

10 64 

10 49 

9 90 

10 IS 

10 01 

9 47 

11 28 

11 73 

10 76 

11 45 

10 76 

14 64 

11 35 

12 66 

n 82 

10 78 

11 33 

9 58 

10 46 

10 61 

9 49 

9 78 

8 39 

13 77 

10 65 

9 78 

9 34 


Row—Column Means 

1 09 

0 96 

0 96 

0 99 

1 07 

1 01 

1 IS 

1 11 

1 07 

1 10 

1 01 

1 06 

0 92 

1 10 

1 00 

1 05 

1 07 

0 96 

0 92 

0 97 

1 07 

0 97 

0 95 

0 90 

0 93 

0 91 

0 66 

1 03 

1 07 

0 98 

1 04 

0 98 

1 33 

1 03 

1 15 

1 07 

0 98 

1 03 

0 87 

0 95 

0 96 

0 66 

0 89 

0 76 

1 2S 

0 97 

0 89 

0 85 


Marginal Statutia, Cube 

Column Sums 

43 04 

42 05 

43 99 

51 97 

40 52 
44 68 

44 16 

44 03 

45 88 

42 67 

41 94 

43 95 

Column Means 0 98 

0 96 

Row Sums 138 19 

1 00 

1 18 

130 62 

0 92 

1 02 

136 12 

1 00 

1 00 

123 94 

1 04 
0 97 

0 95 

1 00 

Row Means 

1 05 

0 99 

1 03 

0 94 




Slice Sums 

316 63 

212 25 






Slice Means 

1 to 

0 88 







provides the basis for detemuiung the statistical results of prime 
interest 

An F-ratio is a measurement of vanation in a selected dimension 
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of the cxpenment This ratio should approximate or approach 
unity if the factor designated docs not cause any vanation in the 
cntenon variable An F-ratio is the ratio of two x’s, each divided 
by their respective degrees of freedom In an analysis of vanance 
design Chi square ( x*) is the sum of squared deviations from the 
mean divided by the vaianrce The variance is the mean square 
deviation of the criterion variable from the mean of the cntenon 
vanable It should be evident from these definitions why the test 
IS called an analysis of vanance 

In discussing the results of the anal>^ of variance first consider 
the summary table The degrees of freedom indicates the number 
of opportunities within the designated dimension of design that 
the cntenon variable may actually vary For example, there were 
eleven subjects so degrees of freedom (d f) for subjects is 10 That 
is after the mean is known and ten of the values for subjects arc 
known then the last subiect is also known There arc, therefore, 
only n-1 degrees of freedom As another example, AB interaction 
has 11 X 3 degrees of freedom since there are twelve A columns 
and four B rows This is evident from the fact that 12 — 1 =11 
and 4 — 1=3 The d f are needed to look up m a standard 
F-table the inequalities required for a chosen level of significance 
as will be mentioned below 

The sum of squares (ss) is an abbreviation for the sum of the 
squares of deviationsofindividual observations in a given dimen¬ 
sion from the over-all mean The total vanation of a given value 
from the over all mean is made up of vanation due to the simple 
effects of A, B and C plus all the interaction effects Therefore, if 
one adds up all the separate ss it will equal the total ss in most 
analysis of vanance tables 

Tins IS not true here because a number of separate error terms 
have been included for more reliable testing of the vanous inter¬ 
action effects For example, error (w) has been subdivided into 
three components each of which is used to test a different simple 
or interaction effect Error (w) denotes the error within the groups 
The mean squares or (ms) are defined as ss divided by d f These 
are used to formulate the F-ratios of the last column Ss divided 
by the population vanance is a Chi square and an F-ratio is 
defined as a ratio of two Chi squares over their respective degrees 
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of freedom Since one of the basic assumptions of a Type VI 
design IS that the population variances arc equal for each treat* 
ment the population variances cancel out in their F-ratio 
The F-ratio is a measurement of the relative vanation between 
components as compared with the vanation within components 
For example, to test the sigmficance of the sum of squared van¬ 
ation from the mean of the 2 G columns, error B is used and hence 
we obtain 6 054/0 217 27 9 

The F-ratios must then be compared with a standard Normally 
if the null hypothesis is true (that is if vanation between com¬ 
ponents of the design show no significant difference from the 
vanation within those components) then the F-ratio should have 
a normal distnbution about umty Actually the F-ratio for the 
G dimension is 27 9 as seen from Table II According to the 
standard table for F-values, anything greater than 22 86 (for d f 
of 1 and 9) would only happen 1 per cent of the time, or once 
in a thousand such expenments, if the null hypothesis was true 
From Table II the F-ratio for A is equally significant At this 
level of probability (P < 001) none of the interactions AB, AC 
or BC are significant As seen from Table II, we are left ^v^th G 
and A as highly significant from a probability aspect 
It IS now necessary to review what the actual relationship of 
mean levels is in these dimensions of the experiment and what 
was designed onginally so that some conclusion may be drawn 
relative to the physiologic meaning, if any, of these findings 
Since the data were transformed the means must be examined 
m the transformed state It is of importance to note what the 
transformation did to the data as compared to the original scale 
Some transformations will invert the scale In this instance the 
scale is still in the same direction and the lower concentrations of 
LAP were reported by lower means in the transformed data used 
in the analysis of variance Sec Figure 4 for the relationship of 
means 

Now Jet us see if we can translate the analysis into simple words, 
that IS wth respect to the onginal condiuons and all the steps 
taken This is where some judgment is required as to ho%v much 
generalization is justified Certainly the results do not indicate 
what would happen m man if he were given a foreign protein 
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Fi^vu-e 4 Shows the mean values for urioeleucuie amino pcpQdase (LAP) plotted 
m arbitrary units on the ordinate and days of the week on the abscissa Tunes 
of Intravenous e^ albumui injecaoos are indicated by the Monday arrows 
Group Cj dogs (d in number) were fed more food on days Tuesday through 
Friday Group C] dogs (S in number) were fed the same restricted diet on all 
days that C| dogs received on weekends The means for all dogs are shown by 
circles Cl, crosses and Cj, dots 


or an altered foc5d intake It can probably be said with considerable 
assurance that LAP excretion in the urine of dogs is reduced by a 
reduction in food intake and that regardless of whether dogs are 
eating a nearly normal ration or a greatly reduced ration, injection 
of egg albumin will cause an increase in the excretion of LAP 
If LAP excretion is influenced by diet and by injection of egg 
ivbite intravenously, is the latter effect to be interpreted as a toxic 
effect or a dietary effect’ Simultaneous analysis of GOT (glutamic 
oxaloacetic transaminase) and B G (beta glucuronidase) revealed 
no significant effects of either diet or egg albumin injection In 
earlier experiments where neomycin was injected there was a rise 
of all three enzymes In circumstances where no nephrotoxin is 
given there is very poor correlation between the excretion of LAP 
and the other two enzymes From these observations the authon 
believe that the nse in LAP excretion after intravenous egg albumin 
in doses of 1 ml/kg and after the animals are sensitized, is not a 
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toxic effect It is more analogous to the effects of diet and suggests 
that perhaps the important vanable in the diet is the amount of 
protein Thus another hypothesis is made and new experiments 
ivill be required to test them 

However, this is not the end of data processing in this illustrative 
expenment If urinary LAP was lower with restncted food intake, 
was there a change in some concomitant moiety in the urine that 
tends to be excreted with LAP^ A multiple regression analysis 
was run by means of a standard library program to find out what 
significant relationships, if any, could be found LAP uas used as 
the dependent vanable 

Table I shows the transformations developed wth the lUustra 
tive experiment It is to be noted that for sodium no transfoimation 
was used in multiple regression analysis because the data ivas close 
enough to Gaussian as it was recorded in the data book Text 
books and papers wniten with statistical treatment of data often 
have the statement, “It is assumed that the data have a normal 
distnbution *’ This statement is often erroneously interpreted to 
mean that the assumption can be made without testing it Just 
because a variable has a Gaussian (or normal) distribution in 
one set of data does not mean that it will be Gaussian m another 
expenment For example, Figure 5 shows a set of sodium data 
in which the mode was close to zero The transformation required 
was ^ where is the f (x) and x is sodium 

When this same transformation was applied to the egg injection- 
expenmental data under discussion the resultant distnbution vs as 
almost as skewed but in the opposite direction 

It IS easier to decide what to do about scales in multiple regres 
sion or correlation analysis because all data may be used for 
determining the transformation required In order to obtain more 
of a feeling for the need of transformations let us consider the 
rationale behind a regression analysis A regression analysis is a 
method of fitting the data to the function of two or more variables 

A common and general method used in estimating a population 
regression curve from sample data is the method of least squares 
The curve to be estimated is “assumed” to be a straight line 
There are several ways to test this assumption of hneanty, but 
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Figure 5 Shows a transIbnnaQoo iron a tcait o! mijhequjvajrais per Jjter to a 
sew fcale chosen because tc would provide a closer approximaDOD to a Gaussian 
trequency distribution 


they will not be discussed here Briefly then, the sample regrcssiaa 
curve of y (dependent variable) on x (independent vanable) of 
given degree is the curve among all those possible of that degree 
that minimizes the sum of squares of vertical (y) deviations of the 
observed points from the curve In the linear case, the degree of 
the curve is one 

By means of a multiple linear regression it is possible to deter¬ 
mine whether several variables, xi, , xt, differ in their eflect on 
a dependent vanable, y Data gathered for regression analysis may 
be classifled into cither a type 1 problem or a type 2 problem 
The type I problem occurs when the effect on y may 6c measureij 
for specific selected values of the variables xi, , x*, chosen by 
the expenmenter The type 2 problem occurs when the points 
(xi, , Xi, y) are chosen at random from a (k 4- 1)—dimensional 
normal distnbution of points in the (xi, , x», y) plane In either 
case the y value observed is considered to be a random observation 
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from a normal population of possible y’s for each particular x used 
These y populations are assumed to have a common standard 
deviation The assumption of normality is necessary for a sig* 
nificance test and confidence intervals in regression analysis 
Therefore, it is necessary to fulfill the basic underlying assump¬ 
tion of normality for the dependent variable in the type 2 
problem 

The multiple regression analysis program was wntten to do the 
analysis as described by Ezekiel and Fox This program is also 
a standard computer library program The output from this pro¬ 
gram was first a table of simple correlations betiveen all variables, 
a multiple regression coefficient, the standard error, and the 
associated F-ratio Finally a table of partial correlation coefficients, 
partial regression coefficients and the associated partial F-ratios 
IS given 

With respect to the specific illustration expenment, the first run 
With 6 variables revealed that potassium and total solute con¬ 
centration were important predictors of LAP while sodium, phos¬ 
phorus and chloride concentrations were not It was observed, 
however, that if a nonparametnc term was added to account for 
the effects of injection on LAP concentration, the multiple-R 
increased from 188 to 294 and the corresponding muItipJe-R 
F-ratio increased from 3 83 to 8 23 This was accomplished by 
assigning a number 3 to each sample collected on Tuesday, 2 for 
Wednesday, 1 for Thursday and 1 for Friday This is a means of 
weighting the days in relation to the Monday injections of egg 
albumin This new variable was called “injection factor ” 

A third regression analysis was then run with potassium, total 
solute (AT) and mjecuon fector as the independent variables This 
analysis will be denoted as Analysis I The multiple-R fell from 
294 to 277 and the corresponding F-ratio rose from 8 23 to 14 49 
with the smaller number of variables selected 

The regression coefficient squared is called the coefficient oj deler- 
mmation Thus (294)* == 086 or 8 6 per cent of the observed 
variation in LAP could be projcctoi from the independent vari¬ 
ables when all seven were used and only 7 7 per cent when the 
data from potassium, freezing-point-depression and injection factor 
were used alone The 7 7 per cent predictability of LAP is more 
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significant, however, because it is accomplished with a smaller 
number of terms 

The fourth and last regression analysis run for this expenment 
^vas the same as Analysis I except that two more independent 
vanables were added The first variable added to the regression 
equation was creatinine, since H was known from previous analyses 
that It should increase the muItiple-R and the corresponding 
F-ratio The second variable added to this equation was “GOT ” 
GOT should be lost from cells when they are damaged and to the 
extent that LAP loss is a toxic or cell damage effect of the egg 
injection there should be correlation This analysis will be denoted 
as Analysis 11 With these two independent variables added, the 
multipIe-R increased from 277 to 371 and the corresponding 
F*ratio increased from 14 49 to 16 69 The coefficient of deter¬ 
mination now IS 0 137 Almost 14 per cent of LAP vanation in 
this expenment can be related to vanation m GOT, creatinine, 
potassium, total solute and “injection factor ” 

For illustrative purposes, the pnnt-outs obtained from the corn- 
puter are reproduced in Table III and Table IV These two 
regression analyses are from Analysis I and Analysis II respectively 
The “echo check,” found at the top of Tables III and IV, is a 
listing of the first set of variables from the experiment These 
values have all been transformed by the obtained normalizing 
factor in order to fulfill the assumption of normality The trans¬ 
formations used in both analyses are LAP » [log(X — 75)]/2, 
Cr = [log (X + 100)1/2, GOT * [logX|/3, K = Pog (X + 7)1/2, 
and AT « [log (X 625)1/2 The X denotes the anthmetic 
values of each random variable “Injection factor” is a term 
representing the time-related introduction of the foreign protein 
In a sense the analysis is one of autocorrelation It is somewhat like 
the evoked response coirelogram type of study used in electro¬ 
encephalography only instead of lodung for brain wave correla¬ 
tion dependent upon time the present analysis searches out 
relationship of LAP excretion to time of introduction of the 
foreign protein and to any time-related excretion of other moieties 

The mean values for the analjrais of each variable are listed m 
the pnnt-outs along with “population standard deviation,” which 
is an estimate, and “standard deviation” which is the value ob- 
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served considering the data as a universe and not as a sample 
Again, these are averages and standard deviations calculated from 
the transformed values 

This still perhaps may be a stumbling block to some How can 
you add all values and divide by the total number if the values 
have been changed from their onginal form by taking their 
logarithms’ Adding the logarithms of numbers is the same as 
multiplication of the original values The answer to this is that 
the onginal scale is no longer of interest 
The transformation was for the purpose of providing a new 
scale in ivhich the distnbution of sample data would be Gaussian 
Ha\nng changed to this scale, the analysis is earned out with the 
new values and the results will be interpreted in terms of the new 
scale The new values can be added and a mean detemuned, and 
the mean will be applicable only to the new scale 

If one wished for some reason to see where this mean ^vouId 
fall on the old scale it would be necessary to take the antilog and 
go through the other manipulations indicated by the transforma¬ 
tion used for that vanable This would then be a sort of geometnc 
mean of the onginal data The calculation of standard deviation 
for this "geometric mean” would require adding and subtracting 
the standard deviation from the mean using the values in Tables 
III and IV and then taking the inverse of the transformation 
for all three values 

To actually extract and interpret values from the regression 
analyses earned out with a transformed scale and to try to deter¬ 
mine their value m the onginal non-Gaussian scale is dangerous, 
if not impossible, and is without purp<»e One scale is as good as 
another The investigator should not be so emotionally attached 
to one scale that all thinking must be done in that scale Actually 
the statistical analyses lead to ratios or pure numbers \vhich are 
Without any scale except for distnbution frequency 

The correlation matrix is a table of simple correlations with 
every vanable correlated ivith every other variable The highest 
correlation coefficient obser> ed in the experiment is between total 
solute and potassium ( 75) In Anal>'Sts I, the best simple cor¬ 
relation with LAP appears to be the "injection factor” term, with 
a correlation coefficient of 0 21 In Analysis II there is a slight 
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TABLE in 

RzauasiOH Analysis I 


No of Observauons «• : 

528 


No of Varuble* 4 

Echo Check 






0 9039 0 9375 I 5692 

t 0000 



ygrubh 

Mran Pap SD 

S/ Dip 


LAB 

1 02 

0 24 

0 24 


Potaiitum 

0 82 

0 15 

0 1$ 


Tool Solute 

1 57 

0 05 

0 05 


Inj Factor 

1 25 

0 60 

0 60 


CemUtun Matrix 

LAP 

PaUsiium Tatal Selale 

Tnj Faetor 

LAP 


1 DO 

0 02 

0 12 

0 21 

pQUutum 


0 02 

1 00 

0 75 

0 00 

Total Solute 

0 12 

0 75 

1 00 

-0 08 

Ibj Factor 


0 2t 

D 00 

-0 08 

1 00 


MU R 

$lJ Eftcr F 

DF Dip Var m LAP 


27678 

0 230M 14 49052 524 



B 

BeU 

F 

Part R 


Potauium 

-0 30 

-0 20 

9 36 

-0 13 


Total Solute 

I 33 

0 28 

19 34 

0 19 


Jnj Factor 

0 09 

0 23 

29 24 

0 23 


A 

-0 93 






negative correlation between creatinine and LAP which would 
suggest, but certainly docs not prove, that LAP comes from kidney 
tubule cells in contrast to creatinine which comes from glomerular 
filtration There are few conclusions that can be drawn from simple 
correlations but the investigator will learn from experience and 
by study of such tables in repeated experiments that there are 
certain moieti« that are usually correlated and this background 
of knowledge may occasionally allow him to pick out some unusual 
findings in a pancular experiment 

In addition, if very high correlation coefficients are observed 
it suggests that these two variables are probably measuring the 
same basic factor The simple correlation matrix is used in the 
computer library program for the multiple regression analysis and, 
therefore, represents results of part of the analysis m progress 
The final results are summarused in the line which lists the 
multiple regression coefficient, standard error of the multiple 
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TABLEIV 

Regression Analysis II 


No of Observations 528 


No of Variables = 

= 6 

Echo Check 






0 4263 0 9039 

1 3116 

0 9375 

1 5692 

1 0000 


VanahU 

Mean 

Pafi SD 

St D/p 



GOT 

0 37 

0 07 

0 07 



LAP 

1 02 

0 24 

0 24 



Creatinine 

1 29 

0 10 

0 10 



Fstassium 

0 82 

0 15 

0 15 



Total Solute 

1 57 

0 05 

0 05 



Inj Factor 

1 25 

0 60 

0 60 



Condatfon Matrix GOT 

LAP 

Cteatuane Potassium 

Total Solute Tnj Factor 

GOT 1 00 

0 24 

0 24 

0 34 

0 34 

0 03 

LAP 0 24 

1 00 

-0 07 

0 02 

0 12 

0 21 

Creatinine 0 24 

-0 07 

1 00 

0 65 

0 57 

0 00 

Potassium 0 34 

0 02 

0 65 

1 00 

0 75 

0 00 

Total Solute 0 34 

0 12 

0 57 

0 75 

1 DO 

0 08 

Inj Factor 0 03 

0 21 

0 00 

0 00 

-0 08 

1 00 


Mlt R 

Std Error 

F 

DF Dtp Var 

• UP 


0 3712S 

0 22281 

16 69 522 



B 

Beta 

F 

Part R 


GOT 

0 75 

0 23 

26 67 

0 22 


Creatmine 

-0 41 

-0 17 

10 43 

•0 14 


Potassium 

-0 23 

-0 15 

4 73 

■0 09 


Total Solute 

1 26 

0 27 

17 77 

0 18 


Inj Factor 

0 08 

0 21 

26 02 

0 22 


A 

-0 63 






regression coefficient, F-ratio for multiple-R, and degrees of free¬ 
dom for the regression analysis (Tables II and IV) The multiple 
“R” IS the correlation coefficient for the dependent variable (LAP) 
observed values compared with the values which would be pre¬ 
dicted from the observed independent variables of creatinine, 
GOT, potassium, freezing point (AT), and injection factor for 
each unne sample using the derived regression formula 

In regression analysis, the standard mor S^/u i is a good 
estimate of the scatter of the peculation about the line or popula¬ 
tion regression plane. If the deviations from the plane are normally- 
distributed, about 95 per cent of the points in the sample tvill he 
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\vithm =*=1 96 S„/ij t of the plane These deviations are meas¬ 
ured m the y (dependent variable) direction In Analysis 11 
(Table IV), S»/ii i = 223 whereas = Si^p = 239 If (he 
deviations are approximately normally distributed, about 68 per 
cent or 359 of tJie 528 observed values of LAP should he within 
223 of the values predicted from the corresponding independent 
variable values by the regression equation It also appears in 
Analysis II that the predicted value for LAP has about ^e same 
standard error as the observed standard error for LAP deter¬ 
minations in this experiment 

Only a restricted interpretation can be made from the mode! 
The variation observed has been determined by the way in which 
the experiment was conducted The correlation model applies here 
only in so far as future experiments are earned out m exactly 
the same way The experiment seems more one of multiple cor¬ 
relation than of multiple regression since no variables were fixed 
except the “injection factor” which is an independent variable 
An F-ratio of 16 69 can be evaluated for the multiple correlation 
model m the same way that it is used for the variance model 
The F-ratio associated with the muUiplc-R is used to test the 
h^'pothesis that the population muItipIe-R is zero Since F =» 16 69 
With 5 and 522 d f is larger than the cntical value F ss (5,522) = 

2 21, we reject the null hypothesis and conclude that the regression 
of LAP on GOT, creatinine, potassium, freezing point and in¬ 
jection factor does account for a significant amount of variation 
observed in LAP 

Another manner of observing the amount of the variation in 
the dependent variable related to the chosen independent variables 
IS by obtaining the coefficient of determination, which is equal to 
r* In the Analysis II, (0 37)’ approximates 14 per cent Fourteen 
per cent of the LAP lanatjon can be accounted for by the vari¬ 
ation in creatinine, sodium, phosphorus and “injection factor” 
using the observed transformed values of these independent van- 
ables and the coefficients listed under “B" in Table IV For 
example, the equation would be LAP = 0 75 (GOT) — 0 41 (Cr) 

- 0 23 (K) 4- 1 26 (AT) + 0 08 (IF) - 0 63 This equation would 
be repeated 528 times with the tramformed values inserted each 
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time and the resultant, estimated LAP would account for about 

14 per cent of the observed LAP 

The “Beta” values are coefficients which would be used in the 
equation if each of the terms were divided by the standard deviation 
for that term 

The F-values listed and the partial correlation coefficients have 
about the same significance for individual terms as the muItipIe*R 
and F-value listed for the over-aU equation The difference here 

15 that the degrees of freedom for the partiabR’s arc 1 and 522 
The coeffiaents of determination can, however, be obtained by 
squanng each partial r Thus it appears that an injection factor 
partial r of 0 22 gives a D of (0 22)* or 4 8 per cent Therefore, 
almost one»third of the predictability of this equation m deter¬ 
mining LAP IS dependent upon the term “injection factor ” 

It stUl cannot be decided whether the LAP loss which occurs 
in relation to injection of egg albumin is a “toxic” effect or perhaps 
an effect of increased protein in the body similar perhaps to the 
dietary effect observed in the C dimension of the analysis of 
vanance Even if GOT represented a toxic effect in the 5 per cent 
coefficient of determination, there is still 95 per cent of LAP 
variation that could be nutritional It is evident that the analytical 
problem here may ^vcll be autocorrelation Factor analysis might 
be a possible answer to this but has not yet been used for this 
experiment A statistical model that has been tried ivith this 
experunent is the covanance model 

In covanance analysis the Ci and C* groups were analyzed as 
high and low feeding groups with LAP as the dependent vanable 
and with GOT, creatinine, potassium, frcczmg-pomt-depression 
and injection factor (IF) as independent variables We know that 
there ^vill be a significant difference m the LAP means betiveen 
Cl and Ct This was determmed m the analysis of vanance It is 
also knoNvn for this expenment that all of the above independent 
vanables are sigmficant predictors of LAP because of the results 
of regression analysis The question, however, is to find out how 
much nutntion influences the slt^ of LAP as predicted by the 
independent vanables If GOT and injection factor arc more related 
to the toxic effect they might not be influenced by the marked 
difference in food intake of the two groups of dogs, Cj and C* 
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GOT as a cntenon \ariablc in the Type VI analysis of vanancc 
showed no significant variation due to diet or injection of egg 
albumin From the regression analysis (Table IV) where the 
variables GOT and creatinine were added, the F-ratio associated 
with GOT IS much more significant than the F-ratjo associated 
Nvith creatinine This is also seen from the two partial-“R”s 
Two covanance analyses were run, one with LAP as dependent 
and potassium, GOT, total solute, creatimne, and IF as inde¬ 
pendent variables and the other with crcatimnc deleted Both 
results showed significant difference of regressions The results 
with creatinine included arc shown m Table V 

GOT and “injection factor*’ arc not interchangeable tenns in 
spite of having similar coefficients of determination m the muluplc 
regression analysis The significance of the regression analysis comes 
from data collected from the Ci group of animals The stnking 
effect of food intake on LAP excreuon influences the regression 
analysis by simple reduction in LAP excreDon The results also 
su^iest that there is minimal “toxic” or damage effect of injecting 
intravenous egg albumin and that perhaps the mam stimulation 
of LAP excretion is “nutritional ” 

The generalizations that can be made from these analyses of 
one experiment must be limited Because of other experimental 
data and also because of the highly significant results observed in 
the C dimension of the Type VI analysis of vanancc and in the 
difference of means of the covanance analysis it is safe to predict 
that m future experiments in dogs nutntion will be an important 
condition to consider in the design when LAP is the cntenon 
measure under study However, nothing can be said about the 
LAP excretion in man 

The mam thir^ to be learned from this example is the jios 
sibilitiy of appljing different statistical models to a set of data 
when the expenment is well designed The final interpretation 
is dependent upon a combmed interpretation of all models plus 
what has been learned from other expenments, what is known 
about the actual organ sy-stem under study and information from 
the literature 

The data processing is only m part looking for significant sta¬ 
tistical differences The first objective is the stimulus to thinking 
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DifTerencc of Regressions 
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Soutci ej Vartaiian 

Dogs 

C 

Error B 
Withui dogs 

A 

AC 

B 

BC 

AB 

ABC 

Error (w) 

Error 1 (w) 

Error 2 (w) 

EnrtJr 3 (w) 

Tsbl 


SUMMARY TABLE 
DF Stm ef Sfaarei Squeiet 


10 8 003 0 800 

1 6 054 6 054 

9 I 950 0 217 

517 26 093 0 OSO 

11 2 052 0 167 

11 0 S40 0 076 

3 0 923 0 308 

3 0 703 0 234 

33 2 60S 0 079 

33 2 377 0 072 

423 16 592 0 039 

99 3 58S 0 036 

27 4 766 0 177 

297 8 242 0 028 

527 34 097 0 065 


F Ri/wx 


27 9 


5 2 
2 1 
1 7 

1 3 

2 8 
2 6 


that comes from orderly assemblage aod reduction of data and 
the variety of viewpoints The second objective is to consider the 
requirements of each model and to be aware cf the distortion and 
erroneous results that may come from applying Gaussian statisucs 
to non^normal data or data that lacks homoscedasticity No smgle 
experiment will be cruaal Expenence with numerous related 
experiments improv es the ability of the investigator to select appro¬ 
priate statistical models, parameters and scales and to generalize 
from the results 

This type of analytical cxpcnmentation with the data would 
be impossible without computers However, if the investigator will 
maintain his perspective and if each tnp to the computer leads to 
added insightand the stimulation of new questions then the authors 
have faith that these powcrfiil techniques ra p be applied to clinical 
problems and that the rules and regulations will be found for 
assistmg in diagnosis, prognostication and treatment evaluation 
with some of the ngor that oasts in other disciplmes TTie avail¬ 
able analytical models have not been exhausted m this example 
Statistical and mathematical methods hav'c required too much 
time by hand to have full application to very many of the com¬ 
plex medical problems and to be applied by physicians wlio know 
these problems Computers open the door to a much more thorough 
smdy of medical data while it is fresh in mmd and in a way that 
rap be fun as well as mfonna&vc 
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INFORMATION ACCESS 

The physician with his patient and a specific question at a 
specific time and place is the focal point of information flow in 
medicine A physician may shift his emphasis to research or 
teaching but still his main concern is the patient He stores infor¬ 
mation about the patient in the record room He may attempt 
to assemble information from similar patients from other records 
There are many similarities between the record room problem 
and the medical library problem, but this chapter is directed pn- 
manly at the storage and recovery of pertinent medical literature 

Infonnation flow is required for and closely related to creativity 
Given average intelligence and sufficient dnve, the best chance 
of being creative is to gather tc^ether, assimilate and retain as 
large a sample of the world’s knowledge pertinent to a problem 
as can be accomplished in the span of one’s attention This alone 
would justify the current activity in library methods and infor¬ 
mation science A second stimulus to the automation of our 
library systems is the exponential, inundating growth of literature 

The requirements of a successful information storage system 
might be classified as (1) a basic understanding of what informa¬ 
tion IS and a system for coding it, (2) an efficient, completely 
automatic storage system which is economical in cost and space 
and which provides a quick random access to its contents with 
facilities for duplication of any documents which are to be dis¬ 
tributed, and (3) a computer program for discnmmatory retrieval 
of stored material 

Most physicians have so far noticed little change in their habits 
of library use They continue to subscribe to a few journals Some 
physicians are subscribing to Current Contents, a iveekly report 
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of the tables of contents in their original format, reproduced from 
more than 600 foreign and Amencan journals and covering more 
than 125,000 articles by title It is possible to skim through these 
titles by the expenditure of an hour or less each week The infor¬ 
mation scientist will usually transfer those titles of interest to 
4x6 inch reference cards and these arc filed away in his personal 
filing cabinet tvhich is rapidly climbing toward the office ceiling 
As time permits, and demands require, segments of this file 
are taken to the library and the original articles skimmed or read 
m detail In a few instances repnnts arc requested by mail and 
further time is spent m filing the repnnts 

Indices such as the Index Medicus, Chemistry Abstracts and 
Biological Abstracts are occasionally consulted Additional infor¬ 
mation ts transmitted by the convergence of physicians at the 
bedside of particular patients and perhaps after the presentations 
of certain papers at national meetings 
During the >ear that this book has been m preparation there 
have appeared some key-word in-coniext indices which are the 
product of computer use and programs such as that prepared by 
H P Luhn One of the first of these was the jr47Cini«/oA'h/ro- 
chemtslTj! which includes 2100 titles of papers permuted according 
to the key words plus some added descriptive identification into 
some 15,000 alphabetically arranged entries The first entry utlc 
contains the word adrenaUne and the other key words are presented 
in sequence, but the word adrenaline is placed m the center of the 
hne This same entry shows up elsewhere with catechol shifted to 
the center and is also found under ammej and under methylaUon 
At the end of each entry is an alphabetical numeric index which 
permits the user to find the reference in a separate section where 
they arc grouped according to categories and then by alphabetical 
order of the semor author There is also an author index and an 
index for addresses by semor authors 
The key words used come from the title or are added by those 
preparing the index These words are selected for their descriptive 
or informational content of the subject matter m the paper 
This initial volume was prepared as an expenment The papers 
selected for inclusion were published between January 1960 and 
July 1961 This represents an intereting expenment from another 
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Standpoint m that it was a co-operative endea\or betiveen a 
foundation in Amsterdam and two computer centers in New York 
The idea was conceived and made a reality in the short space of 
three months Other similar uses of this technique will be men 
tioned below, butfirst a note about the current intensity in research 
and development 

The National Science Foundation has two current publications 
which serve to keep information scientists abreast of developments 
Scientific Information Notes is pubhshed every two months and 
contains information about current meetings, conferences, research 
and development, scienufic information activities abroad and 
publication notes about new journals, indices, directories and other 
publications of potential interest to the scientific community 

The second publication of the National Science Foundation of 
interest is published every six months and is titled Current Re¬ 
search and Development in Scientific Documentation (CRDSD) 
This is cumulative The last issue. Number X, includes mfoimation 
about 450 research projects being earned on m over 200 different 
organizations This represents an increase of approproximately 
100 m the last six months This report desenbes studies of scientists’ 
information needs, uses made of scienufic informauon, communi¬ 
cation problems, new methods of storage and retrieval, mechanical 
translation of languages, equipment design, studies of automated 
character recognition, speech analysis, and many related studies 
In addiuon there are presented bnef statements reviewing the 
progress made in these projects and cross references to related 
studies A glossary of special terms has been added m the last 
edition 

Another similar illustration of the magnitude of the current 
activity is a bibliography prepared by the Amencan Institute of 
Biological Sciences, edited by P C Janaske, entitled Information 
Handling and Science Informauon which covers 1121 references 
between the year 1957 and 1961 This also has a permuted mdex 
m which each title is repeated in the appropriate alphabetical 
sequence for every key ivord in the titles of papers Bnef abstracts 
are pnnted ivith each entry in the bibliography 

The ultimate objective of all of this acuvity is to make it pos¬ 
sible for an individual to obtain quickly a personalized index of 
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the available articles of interest to him, along with a personal 
file of the original paper or its translation if the onginal was m a 
different tongue from the tongue of the user 

L E Allen has suggested an approach to the cataloging of the 
entire body of wntten knowledge This has seemingly been a dream 
of mankind since the invention of svnting and was attempted early 
m the development of civilization However, even the invention 
of the printing press and the publication of large encyclopedias 
failed to keep pace with the increase in knowledge and it is only 
now with the intention of computers that this idea again seems 
feasible If it cter is attempted there iviJl still remain the “classi' 
fied” information which, according to G PicJ, already fills more 
than 100,000 file drawers in the city of Washington and at vanous 
United States military and governmental installations 

More pertinent to the problems of medicine arc the 250,000 
papers a year which are currently cataloged m the National 
Library of Medicine The focal point for information m medicine 
18 the physician as he stands in time and as he considers his role 
in society at that tune He may have just received a phone call 
about a difficult patient with an unusual disease or complication 
He is suddenly extremely interested in obtammg all of the infor¬ 
mation available which will help him provide an optimum basis 
for his decisions and actions m the care of this patient 

Later his role may change to that of an investigator, interested 
in retrieving all available information about similar patients so 
that he may add his own observations and draw his own conclusions 
and perhaps contribute some new bit of information to the store 
of knowledge This new information may arise because of his 
ability to gather together a larger «amp3e of the world’s knowledge 
related to this problem It may result from his new viewpoint 
because of the peculiar circumstances of his exposure, a new 
dimensionality in expenence 

He may be stimulated to undertake some laboratory expen- 
mentation related to this particular problem If so his informational 
requirements may shift and expend He may wish to retrieve 
information from a number <rf related fields, some of which may 
even be outside of medicine 

When he finally reports the results of his research he must 



Information Access 


123 


analyze his paper and summarize it He must gather together 
the total informational content and carefully choose words which 
will catch the eye of those informational scientists working m the 
same area This is his summary, but finally he must be clever 
enough to select words for the title of his paper which will include 
most of the information that is present m the summary 

Most practicing physicians are concerned most of the time with 
information retrieval, but they must go through some of the same 
thought processes in retneving information as the writers went 
through in coding the information in the titles of their papers 
Arnold Lazerow provided the authors an illustration of the prob¬ 
lem as It pertains to diabetes literature The National Library of 
Medicine is classifying its one quarter rmllion articles per year 
under some six or seven thousand subject heading entries Dr Izzo 
at the Center for Documentation and Communication Research 
at Western Reserve University in Cleveland has shown that by 
the use of 121 of these subject heading entries it is possible to 
retneve nearly all of the diabetes related literature, but this still 
leaves 20,000 articles a year of which only about 2500 are actually 
of interest 

A second level of remeval is being developed which will require 
specially trained people to actually read through the papers and 
determine whether they contain experimental data, whether con 
trol groups were used, whether statisucal analyses were applied 
and whether significant differences were found The articles ivill 
probably also be indexed as to their geographic scope of circulation, 
the type of audience for ivhich they seem to have been prepared 
The study at present is attempting to determine what types of 
information about articles are necessary to identify them in ways 
which will be useful to practicmg physicians who are interested 
in obtaining pertinent articles related to their problems 

A third level of retrieval is under study at the Umversity of 
Minnesota under Dr Lazerow and in this the objective is to 
develop methods of codifying papers of interest to the investigator 
In this type of retrieval selected papers will need to be abstracted 
with sufficient information that rather specific questions can be 
asked such as “Does growth hormone administration change the 
insulin content of the pancreas^” “Does growth hormone ad- 
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m.mstration affect the granule content of the beta cells of the 

’"‘?t taLn pomted out m conncctton wtth tins depth of retn^d 
n ^e reduLon tn clencal labor m obtatmng bibhogtaphtes 
* ht cncoura-e some of the more .maginative invesUgatom to 
rtS the ^tmg of revtetv amcles and that such automatton 
unde^hett a„,emmation would lead to sat mgs of time and 
tnfotwtton^d^ ,„d.v,dual Umted States Pubhe 

money r rant of S25 000 a year has been used as an lUustra 

«'‘"^S: t?Tf a lmLbdJay occurs before an tnv„ 

effort could ^ project for 

'“'I'^onlTto find that aU of this has previously been drae 

^rctr^ndencrot - gotemment to ..classffy- the 

products “f mformauon by uUe The words tn the 

At present ff,e pattern fanuhar to users 

title ate ^ 5 , 3 ,e of e,e art has been increased 

of the Ind» h appearance of B A S I 0 

imm^utably d ^ biological research published m 

which IS a key to tn ^ ^ 33 „ This sptem, designed 

Bwlcgml ® scienust's otvn desk, is publuhed eve^ 

tor use at bey word tide lisUng similar to the 

bvo weeks and “ ^"^„tonrf above The scientist uses the 
Inden on according to hu personal interests and 

bey words "1"* h necessary when current 

of course, these chang nearest library or 

papers of to find the reference and then the 

avaJable B.deped 3 , die journal 

onguialpaper^'^''“i“/d.e preparauon of B AS I C and 
Tides of “ b card for computer processing Each 

each IS entered on a pm ^ ,be hne and 

basic word is w.d, a vergule (/) martng the 

^nf*c“ S' The computer program carries out all of th 



Injonnaiion Access 


125 


arranging and pnnts out copy ready for photostatic reproduction 
and offset printing so that any required number of bound copies 
of the abstract can be prepared Words containing more than one 
idea are permuted For example, the word hemoglobin in a title 
would cause that tide to show up under hemo and also under 
globin When a word has two components but only one idea it is 
hyphenated and treated in the program as a single tvord The 
availability of this bound key to the current biological research 
IS a giant step in communication because it shifts the focus to the 
individual investigator in his oivn office and away from the local 
library ^vith its increasing inadcquaaes 

There are two obvious steps that should foUotv and which are 
under study S Zusman and R S Ledley have developed a 
biomedical information retneval program called Tabledex which 
would permit a computer production of a personalized bibliography 
in bound form for use at the worker^s desk and which would facili¬ 
tate the more rapid retneval of papers from a selected segment of 
the world’s literature 

The other development is illustrated by the Walnut System 
prepared for the Central Intelligence Agency ^vhlch makes pos¬ 
sible by compressed microimage storage and random retneval, 
access to millions of documents from wiuch copies could be auto¬ 
matically prepared and transmitted to the investigator in the form 
of windowed fihng cards, the window containing a 3 x 5 cm photo¬ 
micrograph of eight pages of printed matenal at 1/20 or up to 
1 /40 reduction Ultimately a system will undoubtedly be developed 
for immediate wire transmission of the image from storage to the 
user’s desk viewbox and there will be no need to keep a separate 
file in one’s office 

Dunng the last year the United States Department of Health, 
Education and Welfarq. Pubhc Health Service, National Institutes 
of Health, Division of Research Grants, has prepared a sample 
index of research grants that had been awarded in 1960 ivith the 
key word type of program This was prepared with modifying 
words or descrtplors which were selected by their staff This required 
considerable additional time and, therefore, the 1961 grants were 
prepared in a Research Grants Index using only the titles of the 
grants 
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There were 7000 key word headings and phrases used to alpha¬ 
betically arrange the 14,000 titles of current projects This catalog 
was then distributed to each of the grantees so that they could 
have at their desks a rapid access to information about the entire 
research effort being supported by this agency 

It was then decided that although more professional time is 
required for its preparation a greater amount of more concise 
information could be obtained by using descrtplors and these will 
be used in the 1962 index instead of the title words The descriptors 
are words taken either from title or from the information available 
about the research project by the professional personnel This then 
IS the current state of the art and it is obviously in a great state 
of flux 

The great advance in the next decade ivJl undoubtedly come 
from the application of information theory to these problems Just 
as harmonic analysis involves distinguishing rhythms or redun¬ 
dancies of varying frequency m analysis of a curve so information 
theory seeks for the basic amount of information m such diverse 
signals as language, music, color patterns or complex smells In 
a sentence there is a certain predictability of the next letter given 
one, two, three or N numbw of preceding letters There is also 
a certain predictability of word sequence, and there arc larger 
rhythms that cover paragraphs, chapters, books and even fields 
of endeavor according to the Zeitgeist 

Information theory opens a Pandora’s box Up until this time 
indices have been composed essentially of descnptive words taken 
somewhat passively from the titles of papers The levels of retneval 
mentioned in connection with the diabetes study begin to raise 
some interestiDg questions about hoiv much information there is 
in a word or a sentence Information theory tells us that these 
questions can be answered 

According- to dre tfreory, tafynaatian rcpreseacs chc frumtor <rf 
bits (binary digits) required for predicting a letter or a word, etc 
The total information has also been referred to by the term 
“entropy ” Actually man’s search is for new patterns, new redun¬ 
dancy in knowledge ivhich r^uccs the entropy 

What would an encoder need to know about a subject in order 
to determine the informational content of a new paper^ For 
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example, if the first letter in the title were T the probabihty is 
greater than 1 m 26 that the next letter will be an H If the next 
letter turned out to be something other than H this would have 
higher informational content Given two letters, TH, the prob¬ 
abihty IS high that the next letter would be an E Information 
theory has to do with rect^mtion of the patterns which reduce 
the equivocation as to the next letter or the next word or the 
next idea 

How many polar-option decisions are required to uncover all 
of the information m a title^ How would an encoder begin to 
analyze a title, “The Chmcal Significance of Serum Sodium Con¬ 
centration”’ The word of greatest convergence is sodium Chmcal 
and then serum reduce the informational content tremendously 
first by eliminating almost the entire field of pure chemistry and 
secondly by eliminating any direct concern with cell fluid, bone 
sodium, unne sodium, or other potentially important chmcal 
sodium patterns of information 

The convergence patterns m the collection of ivords chmcal serum 
sodium would require, ho\vever, that the encoder read the paper 
and determine ho^v many of these patterns are actually present 
and to what degree That there are pattern implications of the 
term clinical serum sodium coneentration is emphasized by the word 
significance m the tide Some important informational patterns 
might be summarized in some additional groupings such as “The 
rule of electrical ncutraUty” or “The rule of osmotic cquilibnum” 
or “The Gibbs-Donnan Effect” or simply “Tomcity ” These 
vanous thought patterns conveige in the term sodium^ but they do 
not necessarily diverge automatically in the mind of the average 
reader because of the great variation in traimng, in viewpoint, 
in past and present experience, id the currently available mental 
maps that exist ibr each isdiy/dusi who sees these wfords 

The encoder might, therefore, need to determine how much 
of each pattern ^vas present in the article for some theoretical 
average reader Thus there are difficult problems in encoding the 
informational content relative to redundant informaUon and even 
greater difficulty arises relative to judging the content of new 
information patterns or surprisals There is in addition the problem 
of the mcreasing information which anses as one’s viewpoint 
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changes and as the dimensions of one’s experience and observation 
increase With the publication of a new paper containing really 
Significant new information it is to be anticipated that a review 
of older papers on a subject will reveal additional information 
because of the change in dimensionality that has occurred 

There is also the problem of evaluating information If a paper 
contains a new idea, a surprise, a few bits of information that 
could not have been predicted, is it because the author is dreaming 
Without the hindrance of investigation or expenmentaiion^ Is the 
suTpTtsal reproducible’ Does the author present data’ Is there an 
adequate control study’ How sigmficant is the conclusion reached 
from a statistical vieivpomt’ Is the statistical model used appro¬ 
priate to the problem’ 

Obviously considerable empiricism must remain in the encoding 
judgments at least in the present state of the art Fortunately the 
empirical model of man’s thinking is the ever present subject of 
our deliberations and there seems to be some safety in accepting 
certain “common sense” decisions until more objective and optimal 
encoding techniques are proven Today the title of a paper plus 
some added key words, selected by specialists m the subject m 
co-operation with the author, should make possible encoding 
which will be discnmmaeivc 

It would appear that information science has a great growth 
potential, will provide many jobs requiring some shift in educa¬ 
tional preparation from what has been required in related jobs 
Tliere are undoubtedly many research projects where teams of 
scientists have come together from vanous disciplines that could 
benefit from the addition of a well trained information specialist 
This will be hard to accept since we all are information specialists, 
but as research becomes more complex we add chemists, physicists, 
electrical engineers, statisticians, systems specialists, programers, 
etc, etc, why not an information specialist’ This is not just a 
gimmick to raise a secretary’s salary They arc highly trained and 
practically non-existent In fact they probably aren’t defined, but 
that IS one of the difficulties widi progress 
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EDUCATION IN PROBLEM SOLVING 

CURRENT revolution in time-saving machines has some of 
the same implications for change that existed in the Industnal 
Revolution from labor-saving machines Machines can be used 
for storing and retneving information and making routine decisions 
The initiation of any new program along this line requires that 
very human quality, creative thought 
Time-saving machines can relieve man of his last remaining 
drudgery so that he may be really free to be creative and to per- 
foirn those purely human activities which machines cannot provide 
If creative thinking is to be a common commodity there wiIJ 
need to be a never-ending and intensive educational program at 
least for those members of society who are willing to participate 
m the current time saving revolution The development of auto¬ 
mation 18 well established, accelerating in pace, irreversible and 
a barometer of an advanced, healthy, modem society The choice 
IS not \vhether to develop computer techniques m our time The 
only choice is an individual one, namely, whether to participate 
Evolution has always been difficult for some members of society 
but never before has the challenge and the opportunity been so 
great for the individual Today the individual can, through 
properly channeled hard work and education, reap the benefits 
of automation whether he is wth or without status or nches 
It is becoming increasingly apparent that man’s most precious 
commodity is his own time With free tune an individual may 
develop his information handlmg ability He may increasingly 
control his destiny by having control over his environment and 
himself Modem man seeks pmver to know and to make reliable 
decisions As long as he avoids the pitfall of subverting these drives 
129 
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to controlling his fellow men, today’s citizen and society are 
increasingly fortunate 

The challenge for physicians is to make the practice of medicine 
ever more scientific and at the same time more humanly attractive 
These two goals are more than compatible They are supple 
mentary Practicing phj^icians must seek to attain as much of the 
logical, mathematical and statistical education as can be accom¬ 
plished in their busy lifetimes They must introduce these quanti¬ 
tative and objective approaches with increasing emphasis in 
current medical education and they must also encourage the 
changes which are occurring in undergraduate education and 
which seem to be desirable in preparing students for a life of 
change and adaptability 

The great artist must learn something about the medium in 
which he wishes to work The physician who wishes to really 
understand, and especially if he wishes to contribute to the under 
standing of his colleagues, must, like the artist, determine what 
medium is best suited to his problems and then must learn as 
much as is necessary about his chosen tools The difference between 
science and art is one of emphasis Physicians share in man’s 
tendency to dichotomize liis problems and ignore the more 
difilcult portion 

Experience, kindliness, morality and empathy are consistent 
ivith objectivity, knowledge, hard work, intellectual curiosity and 
a thorough understanding and use of the scientific method in the 
practice of medicine The seriously ill patient needs both the art 
and the science of medicine The practicing physician ivho is 
provided ^vltl^ great insight and the assurance of a well rounded 
decision making potentiality will be better able to communicate 
the art of medicine to his patient llie earlier chapters of this 
book have attempted to bring logcrticr a sample of the changes 
which are occumng m medical practice related to our new ability 

tscode jJifojToatJOX} and jraampidate it wjth tbe ^oeed of eJec- 
tncity and in prodigious quantity and complexity 

Mathematical techniques have been developing mainly on a 
theoretical level for centuries At the same time mountains of facts 
about biology, medicine and the problems of individual patients 
have continued to accumulate The catalyst that makes possible 
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the solution of medical problems and that also makes possible the 
application of mathematics to orgamzation of these facts in this 
solution, IS ike stored program digital computer This is a third and 
new dimension which not only establishes a tnangle in our view 
but even makes more visible the line which exists betvseen medicine 
and mathematics 

Here then is the gauntlet The computer is a very complex 
tool which needs some study This is the easiest part of the prob¬ 
lem, however There are those who can help with the use of the 
tool providing a physician knows his problem and provided he 
can familanze himself \vith the pertinent mathematical techniques 

There arc many relatively simple problems that require little 
more than grade school arithmetic As a physiaan works on his 
chosen problem, however, he wll usually soon be led into algebra 
Depending upon the problem, he will then begin to see the need 
for symbolic logic or probability and statistics and possibly some 
speaal techmque m numerical analysts or the calculus 

Each step m the learmng process increases one’s power of 
creativity The mathematical techmques are very powerful tools 
but without the computers and application by ^owledgeable 
individuals these tools he dormant For the practicing physician 
the required area of study is applied mathematics Most physiaans 
will quickly lose their ivay if they attempt to become theoretical 
mathematicians before they attempt to make any use of mathe¬ 
matical skills 

Modem algebra such as the recent course presented on tele¬ 
vision in Continental Classroom is a basic requirement and should 
help the physician to understand such things as numbers, lines, 
sets, vectors, matnees and determinants One of the biggest deter¬ 
rents to the study and understanding of mathematics is notation 
There is no point in continuing to read beyond one’s under¬ 
standing of the symbols Such notation as the symbols for sum¬ 
mation, subscripts, functions, sets, etc must be learned as they 
initially appear in the text of any book 

Multiple texts may contribute to understanding as long as the 
notation is similar and need not contribute to the confusion if 
careful watch is made for the differences m notation Programed 
learning would seem to be a very natural solution to the problem 
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of self-help m developing mathematical skills for the medical 
profession This is a companion to computer programing and 
should be watched for, encouraged and made use of to its maximum 
in medical education 

After modern algebra the physician’s needs will diverge but 
not completely nor permanently The initial study of probability 
and statistics may seem completely different from the study of 
calculus or symbolic logic, but as the study, and especially the 
application develops, there is increasing pressure to satisfy needs 
in other mathematical methods 

For the diagnostician symbolic logic or Boohan algebra will 
seem attractive because it facilitates encoding of information and 
the sorting and decision making that is required m analyzing 
diagnostic decision processes 

The clinical investigator may find his initial interest greatest 
m statistics He wants to know the sample average, range and 
distnbution and the tests for significance of results He is interested 
in experimental design and the types of analytical problems that 
are illustrated in Chapter VI The basic scientist and clinician 
find a common area of interest in the use of statistics 
Vector algebra, matrices and determinants become increasingly 
useful as the physician progresses m statistical applications and 
the more complicated experimental designs, factor analysis and 
the many varieties of least square analysis 

Sooner or later it becomes apparent that the world is very 
nonlinear, that distnbutions are skewed and that more versatile 
approaches are required Relationships and change become of 
sufficient interest as to demand famihanty with differential 
equations, calculus and special techniques of curve analysis 
The clinician who is in practice may not feel the need for 
education in mathematics unless he has one or more specific prob¬ 
lems which he is interested in trying to solve Given the stimulation 
of a proWenr and a I’tidc gmdaeice m the skilh pcqaaed 

for solution of that problem, many physicians are picking up the 
necessary information even though the past mathematical con¬ 
tent of premedical and medical education has been appallingly 
dilute 

Clinicians have more obligation to the revolution in automation 
than simply learning more mathematics Physicians are custodians 
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of the most important biological questions of our time It is neces¬ 
sary for physicians to remain involved m medical research as 
active partiapants Physiaans must recognise that every individual, 
regardless of his traimng, remains ignorant and incompetent m 
many areas outside of his own It is necessary to establish and 
maintain the necessary working relationships to solve these prob¬ 
lems Physicians must be ready to cross over at least into the 
fringes of other disciplines and must extend a helpful and a ready 
hand to those willing to walk on the medical grounds No physician 
can abdicate his responsibility for the advancement of his own 
field simply because of the difiiculues imposed by the very progress 
taking place in his own and related fields 

The imtial penod of these new developments will be most 
exasperai^ngtecaase^trmjtttfersomtfc^ fitnc to efevefop programs 
and to instigate the use of these time saving devices However, 
computers, by performing routine tasks, will make more time avail 
able This wll lead to other problems which physicians ivill be 
required to study 

Physicians will need to help plan for tlie use of time of those 
less fortunate individuals whose frec-time becomes a burden in¬ 
stead of a gift of automation TTiere is a>gument as to whether 
automation will make more jobs than it eliminates Certainly it 
Will create problems in adjustment for those unable to adapt to 
change and it will undoubtedly create some new medical problems 
There are also social implications of automation ivhich physicians 
need to prepare for, such as the maintenance of m 2 iximum freedom 
of medical practice and the maintenance of optimum patient 
physician relationships 

Because of the increasing rapidity of change it is now more 
important that physicians look closely at tlie educational needs of 
the generation of medical students entenng medical school as well 
as those successors who are m the lower grades or who will soon 
be entenng school for the first time Are th<“re any real differences 
in the early educational needs for physicians from the needs of 
other professional and nonprofcssional men> What are the current 
trends and the educational requirements generally^ What will be 
the world, community, medical and personal problems of to¬ 
morrow's physicians^ How should tomorrow’s physician be edu¬ 
cated today'^ 
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